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NUCLEIC ACIDS AND PROTEINS 
RELATED TO ALZHEIMER'S DISEASE, 
^ AND USES THEREFOR 

Field of the Invention 

The present invention relates generally to the field of neurological and 
physiological dysfunctions associated with Alzheimer's Disease. Mok particularly, 

10 the invention is concerned with the identification, isolation and cloning of genes 
which are associated with Alzheimer's Disease, as well as their corresponding 
transcripts and protein products. The present invention also relates to methods for 
detecting and diagnosing carriers of nomial and mutant alleles of these genes, to 
methods for detecting and diagnosing Alzheimer's Disease, to methods of identifying 

15 other genes and proteins related to, or interacting with, the genes and proteins of the 
invention, to methods of screening for potential therapeutics for Alzheimer's Disease, 
to methods of treatment for Alzheimer's Disease, and to cell lines and animal models' 
useful in screening for and evaluating potentially useful therapies for Alzheimer's 
Disease. 

Background of the TnvCT^ Hnn 
Alzheimer's Disease (AD) is a degenerative disorder of the human central 
nervous system characterized by progressive memory impairment and cognitive and 
intellectual decline during mid to late adult life (Katzman, 1986). The disease is 
accompanied by a constellation of neuro-pathologic features principal amongst which 
25 arc the presence of extracellular amyloid or senile plaques, and neurofibrillary tangles 
in neurons. The etiology of this disease is complex, although in some families it 
appears to be inherited as an autosomal dominant trait. Linkage studies have 
identified three genes associated with the development of AD: P-amyloid precursor 
protein (APP)(Chartier-Harlinetal., 1991; Goate et al.. 1991; Murrell et al.. 1991- 
30 Karlinsky et al., 1992; Mullan et al., 1992). presenilin-1 (PS-1) (Sherrington, 1995), 
and preseniIin-2 (PS-2) (Rogaev. 1995, and Levy-Lahad. 1995). 

The presenilins are multi-spaniing membrane proteins which were described 
in substantial detail in POT Publication WO96/34099, the entire disclosure of which 
is incoiporated herein by reference. Although the functions of the presenilins are 
35 unknown, a number of autosomal dominant presenilin mutations have been identified 

SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 PCT/CA97/00051 



-2- 



which are strongly associated with the development of early-onset, aggressive, 
Familial Alzheimer's Disease (FAD). 

The present disclosure describes the identification, isolation, sequencing and 
characterization of several human genes which interact with the presenilins, mutations 
5 in which may lead to FAD. These presenilin-interacting protein genes may be 
involved in the pathways which, when affected by mutant presenilins, lead to the 
development of Alzheimer's Disease. In addition, mutations in the presenilin- 
interacting protein genes, even in the absence of defects in the presenilins, may be 
causative of Alzheimer's Disease. 

10 Summary of the Invention 

The present invention is based, in part, upon the identification, isolation, 
sequencing and characterization of several human genes, referred to herein as 
"presenilin-interacting protein genes" or "PS-interacting protein genes." The products 
of these genes are believed to interact in vivo with the human prcsenilin-1 proteins 

15 and, therefore, are implicated in the biochemical pathways which are affected in 

Alzheimer's Disease. Each of these genes, therefore, presents a new therapeutic target 
for the treatment of Alzheimer's Disease. In addition, PS-interacting protein nucleic 
acids, PS-interacting proteins and peptides, antibodies to the PS-interacting proteins, 
cells transformed with PS-interacting protein nucleic acids, and transgenic animals 

20 altered with PS-interacting protein nucleic acids, all possess various utilities, as 

described herein, for the diagnosis, therapy and continued investigation of Alzheimer's 
Disease and related disorders. 

Thus, it is one object of the invention to provide isolated nucleic acids 
encoding at least a PS-interacting domain of a PS-interacting protein. These PS- 
25 interacting proteins include mammalian S5a subunits of the 26S proteasome, the 
GT24 protein, the p0071 protein, the Rabl 1 protein, the retinoid X receptor-p, the 
cytoplasmic ch^eronin, and several sequences identified herein as clones Y2H35, 
Y2H171, and Y2H41. Preferred nucleotide and amino acid sequences are provided 
herein. It is another object of the invention to provide probes and primers for these 
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PS-interacting protein genes, and to provide nucleic acids which encode small 
antigenic determinants of these genes. Therefore, preferred embodiments include 
sequences of at least 10, 15 or 20 consecutive nucleotides selected from the disclosed 
sequences. 

5 Using the nucleic acid sequences and antibodies disclosed and enabled 

herein, methods for identifying allelic variants or heterospecific homologues of a 
human PS-interacting protein and gene are provided. The methods may be practiced 
using nucleic acid hybridization or amplification techniques, immunochemical 
techniques, or any other technique known in the art. The allelic variants may include 

10 other normal human alleles as well as mutant alleles of the PS-interacting protein 
genes which may be causative of Alzheimer's Disease. The heterospecific 
homologues may be from other mammalian species, such as mice, rats, dogs, cats or 
non-human primates, or may be from invertebrate species, such as Drosoohila or 
eiegans. Thus, it is another object of the invention to provide nucleic acids which 

15 encode allelic or heterospecific variants of the disclosed sequences, as well as the 
allelic or heterospecific proteins encoded by them. 

The it another object of the invention to provide vectors, and particularly 
expression vectors, which include any of the above-described nucleic acids. It is a 
further object of the invention to provide vectors in which PS-interacting protein 
nucleic acid sequences are operably joined to exogenous regulatory regions to produce 
altered patterns of expression, or to exogenous coding regions to produce fiision 
proteins. Conversely, it is another object to provide nucleic acids in which PS- 
interacting protein regulatory regions are operably joined to exogenous coding 
regions, including standard marker genes, to produce constructs in which the 
25 regulation of PS-interacting protein genes may be studied and used in assays for 
therapeutics. 

It is another object of the invention to provide host cells and transgenic 
animals which have been transformed with any of the above-described nucleic acids 



20 
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of the invention. The host cells may be prokaryotic or eukaryotic cells and, in 
particular, may be gametes, zygotes, fetal cells, or stem cells useful in producing 
transgenic animal models. 

In particularly preferred embodiments, the present invention provides a 
non-human animal model for Alzheimer's Disease, in which the genome of the 
animal, or an ancestor thereof, has been modified by at least one recombinant 
construct which has introduced one of the following modifications: (1) insertion of 
nucleotide sequences encoding at least a functional domain of aheterospecific normal 
PS-interacting protein, (2) insertion of nucleotide sequences encoding at least a 
functional domain of a heterospecific mutant PS-interacting protein, (3) insertion of 
nucleotide sequences encoding at least a functional domain of a conspecific 
homologue of a heterospecific mutant PS-interacting protein, and (4) inactivation of 
an endogenous PS-interacting protein gene. Preferred transgenic animal models are 
rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non- 
human primates, but invertebrates are also contemplated for certain utilities. 

It is another object of the invention to provide methods for producing at 
least a functional domain of a PS-interacting protein using the nucleic acids of the 
invention. In addition, the present invention also provides substantially pure 
preparations of such proteins, including short peptide sequences for used as 
20 immunogens. Thus, the invention provides peptides comprising at least 10 or 15 

consecutive amino acid residues from the disclosed and othenvise enabled sequences. 
The invention further provides substantially pure preparations of peptides which 
comprise at least a PS-interacting domain of a PS-interacting protein, as well as 
substantially pure preparations of the entire proteins. 

25 Using the substantially pure peptides and proteins enabled herein, the 

invention also provides methods for producing antibodies which selectively bind to a 
PS-interacting protein, as well as cell lines which produce these antibodies. 



10 
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Another object of the present invention is to provide methods of 
identifying compounds which may have utility in the treatment of Alzheimer's 
Disease and related disorders. These methods include methods for identifying 
compounds which can modulate the expression of a PS-interacting protein gene, 
methods for identifying compounds which can selectively bind to a PS-interacting 
protein, and methods of identifying compounds which can modulate activity of a PS- 
interacting protein. These methods may be conducted in vitro or in vivo , and may 
employ the transformed cell lines and transgenic animal models of the invention. The 
methods also may be part of a clinical trial in which compounds identified by the 
methods of the invention are further tested in human subjects. 

It is another object of the invention to provide methods of diagnosing or 
screening for inherited forms of Alzheimer's Disease by determining if a subject bear, 
a mutant PS-interacting protein gene. Mutant PS-interacting genes may be detected 
by assays including direct nucleotide sequencing, probe specific hybridization, 
restriction enzyme digest and mapping, PCR mapping, ligase-mediated PGR 
detection. RNase protection, electrophoretic mobility shift detection, or chemical 
mismatch cleavage. Alternatively, mutant forms of a PS-interacting protein may be 
detected by assays including immunoassays, protease assays, or electrophoretic 
mobility assays. 

It is also an object of the invention to provide phannaceutical preparations 
which may be used in the treatment of Alzheimer's Disease and related disorders 
which result fiom abeiration in biochemical pathways involving the PS-interacting 
proteins disclosed and enabled herein. Thus, the present invention also provides 
phannaceutical preparations comprising a substantially pure PS-inteiacting protein, an 
expression vector operably encoding a PS -interacting protein, an expression vector 
operably encoding a PS-interacting protein antisense sequence, an antibody which 
selectively binds to a mutant PS-interacting protein, or an antigenic determinant of a 
mutant PS-interacting protein. These phamiaceutical preparations may be used to 
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treat a patient bearing a mutant PS-interacting protein gene which is causative of 
Alzheimer's Disease or related disorders. 

These an other objects of the present invention are described more fully in 
the following specification and appended claims. 

^ Detailed Description of the Invention 

I. Definitions 

In order to facilitate review of the various embodiments of the invention, 
and an understanding of the various elements and constituents used in making and 
using the invention, the following definitions are provided for particular terms used in 
10 the description and appended claims: 

Presenilin. As used without further modification herein, the terms 
"presenilin" or "presenilins" mean the presenilin- 1 (PSl) and/or the presenilin-2 (PS2) 
genes/proteins. In particular, the unmodified terms "presenilin" or **presenilins" refer 
to the mammalian PSl and/or PS2 genes/proteins and, preferably, the human PSl 
15 and/or PS2 genes/proteins as described and disclosed in PCX Publication 
WO96/34099. 

Normal. As used herein with respect to genes, the term "normal" refers to 
a gene which encodes a normal protein. As used herein with respect to proteins, the 
temi "normal" means a protein which performs its usual or normal physiological role 

20 and which is not associated with, or causative of, a pathogenic condition or state. 

Therefore, as used herein, the term "normal" is essentially synonymous with the usual 
meaning of the phrase "wild type." For any given gene, or corresponding protein, a 
multiplicity of normal allelic variants may exist, none of which is associated with the 
development of a pathogenic condition or state. Such normal allelic variants include, 

25 but are not limited to, variants in which one or more nucleotide substitutions do not 
result in a change in the encoded amino acid sequence. 

Mutant. As used herein with respect to genes, the term "mutant" refers to 
a gene which encodes a mutant protein. As used herein with respect to proteins, the 
term "mutant" means a protein which does not perform its usual or normal 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 

PCT/CA97/O0OS1 



-7- 



15 



20 



25 



30 



physiological role and which is associated with, or causative of. a pathogenic 
condition or state. Therefore, as used herein, the term "mutant" is essentially 
synonymous with the tenns "dysfunctional," "pathogenic." "disease-causing." and 
"deleterious." With respect to the presenilin and presenilin-interacting protein genes 
5 and proteins of the present invention, the term "mutant" refers to genes/proteins 
bearing one or more nucleotide/amino acid substitutions, insertions and/or deletions 
which typically lead to the development of the symptoms of Alzheimer's Disease 
and/or other relevant inheritable phenotypes (e.g. cerebral hemorrhage, mental 
retardation, schizophrenia, psychosis, and depression) when expressed in humans. 
This definition is understood to include the various mutations that naturally exist, 
including but not limited to those disclosed herein, as well as synthetic or recombinant 
mutations produced by human intervention. The term "mutant." as applied to these 
genes, is not intended to embrace sequence variants which, due to the degeneracy of 
the genetic code, encode proteins identical to the normal sequences disclosed or 
otherwise enabled herein; nor is it intended to embrace sequence variants which, 
although they encode different proteins, encode proteins which are functionally ' 
equivalent to normal proteins. 

Substantially ptin- As used herein with respect to proteins (including 
antibodies) or other preparaUons. the term "substantially pure" means that the 
preparation is essentially free of other substances to an extent practical and 
appropriate for its intended use. In particular, a protein preparation is substantially 
pure if it is sufficiently free from other biological constituents so as to be useful in for 
example, generating antibodies, sequencing, or producing pharmaceutical 
preparations. By techniques well known in the art. substantially pure proteins or 
peptides may be produced in Ught of the nucleic acid and amino acid sequences 
disclosed herein. In particular, in light of the nucleic acid and amino acid sequences 
disclosed herein, one of ordinary skill in the art may. by application or serial 
application of well-known methods including HPLC or immuno-affinity 
chromatography or electrophoretic separation, obtain proteins or peptides of any 
generally feasible purity. Preferably, but not necessarily, "substantially pure" 
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preparations include at least 60% by weight (dry weight) the compound of interest. 
More preferably the preparation is at least 75% or 90%, and most preferably at least 
99%, by weight the compound of interest. Purity can be measured by any appropriate 
method, e.g., column chromatography, gel electrophoresis, or HPLC analysis. With 
5 respect to proteins, including antibodies, if a preparation includes two or more 

different compounds of interest (e.g., two or more different antibodies, immunogens, 
functional domains, or other polypeptides of the invention), a "substantially pure" 
preparation is preferably one in which the total weight (dry weight) of all the 
compounds of interest is at least 60% of the total dry weight. Similarly, for such 

10 preparations containing two or more compounds of interest, it is preferred that the 
total weight of the compounds of interest be at least 75%, more preferably at least 
90%, and most preferably at least 99%, of the total dry weight of the preparation. 
Finally, in the event that the protein of interest is mixed with one or more other 
proteins (e.g., serum albumin) or compounds (e.g., diluents, excipients, salts, 

15 polysaccharides, sugars, lipids) for purposes of administration, stability, storage, and 
the like, such other proteins or compounds may be ignored in calculation of the purity 
of the preparation. 

Isolated nucleic acid. As used herein, an "isolated nucleic acid" is a 
ribonucleic acid, deoxyribonucleic acid, or nucleic acid analog comprising a 

20 polynucleotide sequence that is isolated or separate from sequences that are 

inmiediately contiguous (one on the 5' end and one on the 3* end) in the naturally 
occurring genome of the organism from which it is derived. The term therefore 
includes, for example, a recombinant nucleic acid which is incorporated into a vector, 
into an autonomously replicating plasmid or virus, or into the genomic DNA of a 

25 prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 
genomic DNA fragment produced by PCR or restriction endonuclease treatment) 
independent of other sequences. It also includes a recombinant DNA which is part of 
a hybrid gene encoding additional polypeptide sequences and/or including exogenous 
regulatory elements. 
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Substantially identical segufttirp As used herein, a "substantially 
identical" amino acid sequence is an amino acid sequence which differs only by 
conservative amino acid substitutions, for example, substitution of one amino acid for 
another of the same class (e.g., valine for glycine, arginine for lysine, etc.) or by one 
or more non-conservative substitutions, deletions, or insertions located at positions of 
the amino acid sequence which do not destroy the fimction of the protein (assayed, 
e.g., as described herein). Preferably, such a sequence is at least 85%, more 
preferably 90%, and most preferably 95% identical at the amino acid level to the 
sequence of the protein or peptide to which it is being compared. For nucleic acids, 
the length of comparison sequences will generally be at least 50 nucleotides, 
preferably at least 60 nucleotides, more preferably at least 75 nucleotides, and most 
preferably 1 10 nucleotides. A "substantially identical" nucleic acid sequence codes 
for a substantially identical amino acid sequence as defined above. 

Transforpiedcell. As used herein, a "transformed cell" is a cell into which 
(or into an ancestor of which) has been introduced, by means of recombinant DNA 
techniques, a nucleic acid molecule of interest. The nucleic acid of interest will 
typically encode a peptide or protein. The transformed cell may express the sequence 
of interest or may be used only to propagate the sequence. The term "transformed" 
may be used herein to embrace any method of introducing exogenous nucleic acids 
including, but not limited to, transformation, transfection, electroporation, 
microinjection, viral-mediated transfection, and the like. 

Operably joined As used herein, a coding sequence and a regulatory region are 
said to be "operably joined" when they are covalently linked in such a way as to place 
the expression or transcription of the coding sequence under the influence or control 
of the regulatory region. If it is desired that the coding sequences be translated into a 
functional protein, two DNA sequences are said to be operably joined if induction of 
promoter function results in the transcription of the coding sequence and if the nature 
of the linkage between the two DNA sequences does not (1) result in the introduction 
of a frame-shift mutation, (2) interfere with the ability of the regulatory region to 
direct the transcription of the coding sequences, or (3) interfere with the ability of the 
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corresponding RNA transcript to be translated into a protein. Thus, a regulatory 
region would be operably joined to a coding sequence if the regulatory region were 
capable of effecting transcription of that DNA sequence such that the resulting 
transcript might be translated into the desired protein or polypeptide. 
5 Stringent hybridization conditions. Stringent hybridization conditions is a term 

of art understood by those of ordinary skill in the art. For any given nucleic acid 
sequence, stringent hybridization conditions are those conditions of temperature, 
chaotrophic acids, buffer, and ionic strength which will permit hybridization of that 
nucleic acid sequence to its complementary sequence and not to substantially different 

10 sequences. The exact conditions which constitute "stringent" conditions, depend upon 
the nature of the nucleic acid sequence, the length of the sequence, and the frequency 
of occurrence of subsets of that sequence within other non-identical sequences. By 
varying hybridization conditions from a level of stringency at which non-specific 
hybridization occurs to a level at which only specific hybridization is observed, one of 

15 ordinary skill in the art can, without undue experimentation, determine conditions 
which will allow a given sequence to hybridize only with complementary sequences. 
Suitable ranges of such stringency conditions are described in Krause and Aaronson 
(1991). Hybridization conditions, depending upon the length and commonality of a 
sequence, may include temperatures of 20°C-65°C and ionic strengths from 5x to O.lx 

20 SSC. Highly stringent hybridization conditions may include temperatures as low as 
40-42**C (when denaturants such as forraamide are included) or up to eO-eS^'C in ionic 
strengths as low as O.lx SSC. These ranges, however, are only illustrative and, 
depending upon the nature of the target sequence, and possible future technological 
developments, may be more stringent than necessary. Less than stringent conditions 

25 are employed to isolate nucleic acid sequences which are substantially similar, allelic 
or homologous to any given sequence. 

Selectively binds, As used herein with respect to antibodies, an antibody 
is said to "selectively bind" to a target if the antibody recognizes and binds the target 
of interest but does not substantially recognize and bind other molecules in a sample, 

30 e.g., a biological sample, which includes the target of interest. That is, the antibody 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



-11- 



10 



15 



20 



25 



30 



must bind to its target with sufficient specificity so as to distinguish the target from 
essentially all of molecules which would reasonably be present in a biological sample 
including the target. 

The Presenilins and Pre senilin-Interactine Proteins 

The present invention is based, in part, upon the discovery of a family of 
mammalian genes which, when mutated, are associated with the development of 
Alzheimer's Disease. The discovery of these genes, designated presenilin-1 (PSl) and 
presenilin-2 (PS2), as well as the characterization of these genes, their protein 
products, mutants, invertebrate homologues, and possible functional roles, are 
described in PCT Publication WO96/34099. The present invention is further based, in 
part, upon the discovery of a group of proteins which interact with the presenilins 
under physiological conditions and which, therefore, are believed to be involved in the 
biochemical pathways which are altered in Alzheimer's Disease. These proteins are 
referred to herein as presenilin-interacting (PS-interacting) proteins. Because 
mutations in the presenilins are known to be causative of Alzheimer's Disease, each of 
the PS-interacting geiies and proteins disclosed and described herein presents a novel 
target for therapeutic intervention in Alzheimer's Disease. That is, modulation of the 
interactions of these proteins with the presenilins, or modulation of the interactions of 
at least the PS-interacting domains of these PS-interacting proteins with at least the 
interacting domains of the presenilins. provides a means of modulating the activity 
and/or availability of the presenilins, or of modulating the activity and/or availability 
of the PS-interacting proteins. Furthermore, as aberrations in the interactions of 
mutant presenilins with one or more of these PS-interacting proteins is causative of 
Alzheimer's Disease, mutations in one or more of these PS-interacting proteins are 
also likely to be causative of Alzheimer's Disease. Therefore, each of the PS- 
interacting genes and proteins disclosed and described herein presents a novel target 
for diagnosis of forms of familial and/or sporadic Alzheimer's Disease with an 
etiology independent of mutations in the presenilins. Finally, as described more fully 
below, the PS-interacting genes and proteins described and disclosed herein provide 
for new assays for compounds which affect the interactions of the presenilins and PS- 
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interacting proteins, assays for other members of the biochemical pathways involved 
in the etiology of Alzheimer's Disease, and new cell lines and transgenic animal 
models for use in such assays. 

5 1. Presenilin Processing 

Employing the antibodies and protein-binding assays described and/or 
enabled in PCT Publication WO96/34099, the processing and protein-protein 
interactions of both normal and mutant presenilins were investigated. It was found 
that mutations in the presenilins appear to lead to changes in both their intracellular 
10 processing (e.g., endoproteolytic cleavage, ubiquitination, and clearance) and their 
intracellular interactions with other proteins expressed in human brain. As described 
below, knowledge of presenilin processing and interactions, and particularly changes 
in mutant presenilin processing and interactions, provides for new diagnostic and 
therapeutic targets for Alzheimer's Disease and related disorders. 

Western blot analysis suggests that the normal presenilins undergo 
proteolytic cleavage to yield characteristic N- and C-terminal fragments. As noted 
above, the normal presenilin proteins have an expected molecular mass of 47-5 1 kDa 
depending, in part, upon mRNA splice variations, electrophoretic conditions, etc. 
Analysis of Western blots suggests, however, that the normal presenilin proteins 
undergo proteolytic cleavage to yield an approximately 35 kDa N-terminal fragment 
and an ^proximately 18 kDa C-terminal fragment. In particular, Western blots 
bearing lysates from wild-type native human fibroblasts, human neocortical brain 
tissue from control subjects, and neocortical brain tissue from non-transgenic and PSl 
transgenic mice using antibodies ("14.2") recognizing PSl -specific residues 1-25 at 
the N-terminus reveal the presence of a strong immunoreactive band of approximately 
35 kDa and, after longer exposures, a weaker band of approximately 45 kDa which 
presumably represents the full-length PSl protein. Antibodies ("520") directed at 
residues 304-318 at the apex of the TM6-f7 loop of PSl, and antibodies ("4627") 
directed at residues 457-467 in the C-terminus of PSl, both recognize the same strong 
band of approximately 18 kDa. Antibodies 520 also recognize a weak band of 45 kDa 
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coincident with the PSI band detected by 14.2. Sequencing of the major C-tenninal 
fragment from PSl-transfected human embryonic kidney cells (HEK 293) showed 
that the principal endoproteolytic cleavage occurs near M298 in the proximal portion 
of the TM6-*7 loop, possibly by enzymes other than the proteasome. These 
5 observations suggest that an endoproteolytic cleavage event occurs near the junction 
ofexons9and lOofPSI. FulllengthPSl in these cells is quickly turned over (t,,< 
60 rain.) by the proteasome. 

To determine whether mutations in the presenilin proteins result in 
alterations of their proteolytic cleavage. Western blots containing lysates of fibroblast 
10 and neocortical brain homogenates from normal subjects and subjects carrying PSI 
mutations were investigated with the PS 1 specific antibody Ab 14.2. In fibroblasts, 
there were no obvious differences in the relative imensities of the protein bands when 
lysates. from heterozygous carriers of the PSI mutations were compared with normal 
homozygotes. In contrast, there appeared to be a difference between PS 1 mutation 
15 carriers and nonnals in homogenates of temporal neocortex from AD affected 

heterozygous earners of either the PSI A246E or C4I0Y mutations (which are located 
in TM6 and TM7 respectively). In heterozygotes. a strongly immunoreactive band of 
approximately 45 kDa was detected which initially appeared to correspond to the full- 
length PSI protein. Further analysis, however, revealed that this band represents an 
20 alternatively processed presenilin product. A similar band corresponding to this 
mutant processed PS 1 was observed in neocortical homogenates from some sporadic 
late-onset AD patients. These data suggest that (1) some pathogenic PSI mutations 
associated with early-onset AD alter the way in which the presenilins are processed 
through endoproteolytic and proteasome pathways and (2) the presenilin proteins, and 
25 changes in the processing of the presenilins in the brain, are also implicated in late- 
onset and sporadic AD. 

2- Presenilin- Interactrnp Proteins 

In order to identify proteins which may bind to or otherwise interact with 
30 the presenilins invivo, a yeast two-hybrid system was used as described below 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



-14- 



(Example 1). In particular, because mutations in the TM6->7 loop domains are 
known to be causative of AD, a yeast two-hybrid system was used to identify cellular 
proteins which may interact with normal and mutant presenilin TM6-^7 loop 
domains. Yeast two-hybrid studies were also done with cDNAs corresponding to the 
5 C-terminal 18 kDa endoproteolytic cleavage fragment, and with cDNAs 

corresponding to the TMl-^2 intralimiinal loop domain, which is also the site of the 
FAD associated Yl 15H missense mutation. In brief, cDNA sequences encoding the 
TM6->7 loop (i.e., residues 266 to 409 of PSl) were ligated in-frame to the GAL4 
DNA-binding domain in the pAS2-l yeast expression plasmid vector (Clontech). 

10 This plasmid was then co-transformed into S. cerevisiae strain Y190 together with a 
library of human brain cDNAs ligated into the pACT2 yeast expression vector bearing 
the GAL4 activation domain (Clontech). After appropriate selection and re-screening, 
a number of clones were recovered and sequenced bearing human brain cDNAs 
encoding peptides which interacted with the normal presenilin TM6-f7 domain. To 

15 determine whether these presenilin interactions would be modified by AD related 
mutations within the TM6->7 loop, the yeast two-hybrid system was again used with 
TM6->7 loop peptides containing the L286V, the L392V, and the exon 10 splicing 
mutants. When these mutant constructs were used as "bait" to re-screen the brain 
cDNA:GAL4 activation domain library, some but not all of the brain cDNA 

20 sequences which interacted with the normal presenilin were recovered. In addition, 
several new clones were identified which interacted with the mutant but not the 
normal presenilins. The clones corresponding to the PS-interacting proteins with the 
highest presenilin affinity are described in Example 1 and below. 

PS-interacting proteins, particularly those which interact selectively with 

25 either the normal or mutant presenilins, provide new targets for the identification of 
useful pharmaceuticals, new targets for diagnostic tools in the identification of 
individuals at risk, new sequences for the production of transformed cell lines and 
transgenic animal models, and new bases for therapeutic intervention in Alzheimer's 
Disease. In particular, the onset of AD may be associated with aberrant interactions 

30 between mutant presenilin proteins and normal forms of PS-interacting proteins such 
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as those identified using the methods described herein. These changes may increase 
or decrease interactions present with normal PSl or may cause interaction with a 
novel mutation-specific PS-interacting protein. In addition, however, aberrant 
interactions may result from nonnal presenilins binding to mutant foms of the PS- 
5 interacting proteins and, therefore, mutations in the PS-interacting proteins may also 
be causative of AD. 

A. The S5a Su hunit of the 26S Proteasnmp. 

Two overlapping clones have been identified as representing a portion of 
the human protein alternatively known as Antisecretory Factor ("ASF") or the 

10 Multiubiquitin chain-binding S5a subunit of the 26S proteasome ("S5a"). These 
clones, which together include residues 70-377 of S5a, were shown to interact with 
the normal presenilin TM6->7 loop domain but only weakly with two TM6->.7 loop 
domain mutants tested (L286V. L392V). The PS 1 :S5a interaction was confirmed by 
co-immunoprecipitation smdies, and immunocytochemical studies showed S5a and 

15 PS 1 are expressed in contiguous intracellular compartments in brain cells typically 
affected by AD. 

The interaction between PSl and the proteasome could be relevant to the 
pathogenesis of Alzheimer's Disease (AD) through several possible mechanisms. 
First, most mammalian cells seem to maintain very low levels of the PSl holoprotein. 
A notable exception to this are cells expressing the PS I A290-3 1 9 splicing mutation, 
which results in a mutant PSl holoprotein which is not endoproteolytically cleaved 
and which is. therefore, readily detectable. In the case of the A290-319 splicing 
mutation at least, the presence of the mutant PSl holoprotein. or the absence or 
reduction in the 35 kDa N-terminal and 18 kDa C-terminal fragments, appears 
sufficiem to cause AD. It is possible, therefore, that even veiy subtle changes in the 
turnover of the mutant PSl holoprotein might have significant pathophysiological 
effects. Thus, mutations in either the presenilins or S5a which perturb the PSl :S5a 
interaction in the mammalian CNS may cause the presenilin holoprotein to be 
aberrantly processed and cause AD. Therefore, modulation of presenilin proteolytic 
30 pathways might be applied therapeutically to enhance removal of mutant holoprotein. 

SUBSTITUTE SHEET (RULE 26) 



20 



25 



wo 97/27296 PCT/CA97/00051 



-16- 



To assess a potential in vivo relationship between PS 1 and the S5a subunit 
of the 26S proteasome, the effects of proteasome inhibitors on PSl metabolism were 
investigated. Short term organotypic cultures of neonatal rat hippocampus and 
carcinoma of colon (CaCo2) cells (which express high levels of both PSl and PS2) 
5 were administered either the specific, reversible proteasome inhibitor N-acetyl- 

leucinyl-leucinyl-norleucinyl-H (LLnL) (Rock et al., 1994), or the specific irreversible 
proteasome inhibitor lactacystin (Fenteany et al., 1995). Both agents caused an 
increase in the steady state levels of PSl holoprotein. Both agents also prolonged the 
half-life of the PSl holoprotein in pulse chase experiments in hippocampal slices from 

10 - 1 5 minutes to -35 minutes. As noted above, the PSl holoprotein appears to be 
rapidly turned over in normal cells. However, even after four hours of metabolic 
labelling, neither of the proteasome inhibitors affected the level of the 35 kDa N- 
terminal PSl fragment, or resulted in the appearance of novel species. These studies 
imply that the majority of the PSl holoprotein is catabolized directly via a rapid, 

15 proteasome dependent pathway in a maimer similar to several other integral 

membrane proteins (e.g. Sec61 and CFTR). On the other hand, because the -35 kDa 
and - 1 8 kDa terminal fragments are still produced in the presence of proteasome 
inhibitors, this endoproteolytic cleavage of PSl is probably not mediated by the 
proteasome pathway. Therefore, it appears that at least two proteolytic pathways act 

20 upon the PS 1 holoprotein. 

An alternate possibility is that mutant PS 1 :S5a interactions may modify 
the function or the cellular regulation of S5a. To address this possibility, S5a levels 
were examined by Western blotting of lysates from postmortem temporal neocortex 
from non-AD neurologic controls (n = 8), sporadic AD (n = 8) and PSl -linked FAD 

25 (n = 4). In the majority of non-AD brains, polyclonal anti-S5a antibodies specifically 
detected an S5a species with Mr of ~ 50 kDa, which could be abolished by 
preabsorption of the antibody with recombinant His6-S5a or with extracts ofmvc-SSa 
transfected cells. In a subset of these control cases an additional S5a reactive band 
was observed at -34 kDa. In contrast, in tissue from all subjects with sporadic late 

30 onset AD, the predominant S5a reactive species was observed at - 40 kDa which was 
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not seen in control tissue. The origin, and the functional significance of this aUered 
electrophoretic mobility is unclear but indicates that S5a processing is altered in AD 
brains, irrespective of whether the AD is presenilin-linked or sporadic. 

Thus, the presenilin-proteasome interaction appears significant in several 
respects. First, the facts that the normal presenilin TM6-J.7 loop domain interacts 
with the S5a protein, that the mutant presenilin TM6->7 loop domains fail to interact 
(or interact very weakly) with the S5a protein, that presenilins bearing mutations in 
the TM6^7 loop domain appear to be differently cleaved and multiubiquitinated, that 
proteasomes are known to be involved in the cleavage and clearance of a variety of 
proteins (particularly multiubiquitinated proteins), that inhibition of proteasome 
activity inhibits cleavage of the presenilin holopioteins, and that S5a processing is 
altered in AD brains, all suggest (1) that the S5a subunit and the 26S proteasome are 
involved in the normal processing of the presenilins and that mutations which disrupt 
this normal interaction may be responsible for the abnormal processing observed in 
TM6->7 loop domain mutants; or (2) that the presenilin-proteasome interaction may 
modulate the activity of PSl, S5a, or both, with or without involving proteasome- 
mediated presenilin processing; or (3) that modulation of the normal qualitj; control 
function of proteasome-mediated degradation of misfolded or mutant membrane 
proteins trafficking through the ER and Golgi (such as APP, Notch, or Prion proteins), 
and of misfolded, mutant, or ubiquitinated cytoplasmic proteins (including structural 
proteins such as tau, and short lived, proteasome processed signaling molecules such 
as NFkB). Thus, defective proteasome function might selectively cause these proteins 
(especially PAPP, tau. Prion) to be aberrantly metabolized. The latter would lead to ' 
the accumulation of neurotoxic, amyloidogenic protease-resistant derivatives such as 
AP and PrPsc, the accumulation of neurofibrillary tangles, and defective intracellular 
signaling fimctions. In support of these hypotheses, it should be noted that failure to 
clear hyperubiquitinated phosphorylated tau and other microtubule associated proteins 
is a prominent feature of Alzheimer's Disease (Kosik and Greenberg, 1994). 
suggesting a possible link between TM6->7 loop domain mutants, presenilin- 
proteasome interactions, tau-proteasome interactions, and the neurofibrillary tangles 
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of tau protein in AD brains. Finally, proteasomes are known to be capable of 
degrading APP and of binding the Ap peptides which are associated with Alzheimer^s 
Disease, suggesting a possible link between TM6->7 loop domain mutants, 
presenilin-proteasome interactions, APP-proteasome interactions, and the amyloid 
5 plaques characteristic of AD brains. Furthermore, administration of proteasome 
inhibitors such as LLnL and Lactacystin cause severe disturbances in PAPP 
metabolism with increases in intracellular immature N-glycosylated pAPP, and the 
secretion of much larger amounts of AP42 isoforms into the media (Klafki, et al., 
1996). 

10 Therefore, presenilin processing and the presenilin-proteasome interaction 

are clear targets for the diagnosis as well as therapeutic intervention in AD. Thus, as 
described below, assays may now be provided for drugs which affect the proteasome- 
mediated cleavage of the presenilins, which affect the alternative endoproteolytic 
cleavage and ubiquitination of the mutant presenilins, or which otherwise affect the 

15 processing and trafficking of the presenilins or the S5a subunit of the proteasome. In 
addition, as mutations in the 26S proteasome which disrupt the nonnal processing of 
the presenilins are likely to be causative of Alzheimer's Disease, additional diagnostic 
assays are provided for detecting mutations in the S5a or other subimits of the 
proteasome. Finally, additional transformed cell lines and transgenic models may 

20 now be provided which have been altered by the introduction of a normal or mutant 
sequence encoding at least a functional domain of the proteasome. The appearance of 
abnormal electrophoretic forms of S5a (and/or other proteasome subunits) in biologic 
tissues and fluids can be used as a clinical test for diagnosis and monitoring of disease 
activity in subjects with sporadic forms of AD. 

25 B. GT24: A Protein with "Armadillo" Repeats 

Another PS-interacting protein, designated GT24, was identified from 
several over-lapping clones obtained using a PS 1 266-409 domain as bait in the yeast two- 
hybrid system and a human adult brain cDNA library. Six longer GT24 clones of 
-3.8 kb in size were subsequently obtained by screening of conventional cDNA 

30 libraries. The open reading frame within the longest GT24 clone obtained to date 
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(Accession number U81004) suggests that GT24 is a protein of at least 1040 amino 
acids with a unique N-terminus, and considerable homology to several armadillo 
(arm) repeat proteins at its C-tenninus. Thus, for example, residues 440-862 of GT24 
(numbering from Accession number U81004) have 32-56% identity (p=1.2e-'") to 
5 residues 440-854 of murine pl20 protein (Accession number Zl 7804), and residues 
367-815 of GT24 have 26-42% identity (p=0.0017) to residues 245-465 of the 
melanogaster armadillo segment polarity protein (Accession number P18824). The 
GT24 gene maps to chromosome 5pl5 near the anonymous microsatellite marker 
D5S748 and the Cri-du-Chat syndrome locus. 

10 "y'»"'l'zationofunique 5- sequences ofGT24 to Northern blots reveals 

that the GT24 gene is expressed as a range of transcripts varying in size between -3.9 
and 5.0 kb in several regions of human brain, and in several non-neurologic tissues 
such as heart. In addition, iiLsitu hybridization studies using a 289 bp single copy 
ftagmem from the 5' end of GT24 in four month old murine brain reveal GT24 
15 transcription closely parallels that of PSl. with robust expression in dentate and 
hippocampal neurons, in scattered neocortical neurons, and in cerebellar Purkinje 
cells. In day E13 murine embryos, GT24 is widely expressed at low levels, but is 
expressed at somewhat higher levels in somites and in the neural tube. A 
physiological iiLsdvo interaction between GT24 and PS I is supported by co- 
20 immunoprecipitation studies in HEK293 cells transiently transfected with a wild type 
human PSl cDNA, a cnmyc-tagged cDNA encoding residues 484-1040 of GT24 
(including the C-terminal affi repeats), or both cDNAs. Cell lysates were 
immunoprecipitated with anti-PS 1 antibodies and then investigated for the presence of 
the myc:GT24 protein by immuno-blotting. In PSl/rnyc-GT24 double transfected 
25 cells, the immunoprecipitates contained a robust anti-myc reactive band of Mr -60 
kDa, which co-migrated with a myc-GT24 control. In cells transfected with njic- 
GT24 only, a very weak band was detected after long exposures, presumably 
reflecting interaction of the mYC-GT24 with low levels of endogenous PS 1 . No myc- 
reactive bands were detected in cells transfected with PS 1 alone, or in any of the 
30 transfected cells immunoprecipitated with pre-immune serum. Taken together, these 
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observations strongly suggest that the observed PSl :GT24 interaction is 
physiologically relevant. 

To explore whether mutations in the TM6-TM7 loop of PSl might 
influence the PSl :GT24 interaction, we employed quantitative liquid P-galactosidase 
5 assays to directly compare the yeast-two-hybrid interaction of the C- terminal residues 
499-1040 of GT24 with wildtype and mutant PSlj^^^. These studies revealed that 
the interaction of GTZ4^^_^Q^ with a L286V mutant PSl domain was not significantly 
different from the interaction with the corresponding wild type PS 1 domain. In 
contrast, there was a significant reduction in the GT244„.,o4o interaction with the 
10 L392V mutant PS 1 construct. The absence of an effect of the L286V mutation, and 
the presence of an effect with the L392V mutation, may suggest that some mutations 
may effect PSl :GT24 binding, while others may modulate the PSl response to GT24 
binding. 

The PS 1 :GT24 interaction could support several functions. The arm repeat 
15 motif of GT24 has been detected in several proteins with diverse functions including 
P-catenin and its invertebrate homologue armadillo, plakoglobin, pi 20, the 
adenomatous polyposis coli (APC) gene, suppressor of RNA polymerase 1 in yeast 
(SRPl), and smGDS. For example, P-catenin, pi 20 and plakoglobin play an essential 
role in intercellular adhesion. B-catenin/ armadillo is involved in transduction of 
20 winglessAVnt signals during cell fate specification, and p-catenin and pi 20 may play 
a role in other receptor mediated signal transduction events including responses to 
trophic factors such as PDGF. EGF, CSF-1 and NGF. 

If the PS1:GT24 interaction is part of intercellular signaling pathways for 
trophic factors, or is involved in cell-cell adherence, disruption of the interaction may 
25 be involved in the neurodegenerative processes in PS-Unked FAD brains, and in the 
increased sensitivity of PSl or PS2 transfected cells to apoptosis (Wolozin et al., 
1996). It is of note that at least one arm protein, smGDS, stimulates GDP/GTP 
exchange on intracellular G-proteins (Kikuchi et al. 1992; Borguski et al., 1993), and 
that mutant forms of both PAPP and PS2 are thought to activate progranmied cell 
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death pathways through mechanisms involving heterotrimeric GTP/GDP proteins 
(Wolozin et al, 1996; Okaraoto, et ai., 1995; Yamatsuji, et al. 1996). 

The interaction between PSl and GT24 may also be involved in some of 
the developmental phenotypes associated with homozygous PSl knockouts in mice 
5 such as failed somitogenesis of the caudal embryo, short tail, and fatal cerebral 
hemorrhage at around day EI3.5 (Wong et al., 1996). The resemblance of these 
skeletal phenotypes to those associated with null mutations in PA2C1 and Notch, and 
the apparent suppressor effect of mutations in sell2 on Notch/linl2 mediated 
signaling in C. elegans suggest that the PS proteins fiihction in'the Notch signaling 
10 pathway. In addition, mice homozygous for a knockout of the Wnt-3a gene (Takada 
et al.. 1994). and murine homozygotes for a spontaneous mutation, "vestigial tail" or 
Yt. in the WnN3a gene (Greco et al.. 1996). have skeletal phenotypes of defective 
caudal somite and tail bud fonnation. The WQtOa knockouts are embryonic lethal by 
day 12.5. These phenotypes are similar to those of homozygous knockouts of the 
15 murine PSl gene (Wong et al., 1996). The observation that GT24 binds to PSl. is 
expressed in embtyonic somites, and contains the aimadillo repeat motif of other 
proteins used in the downstream signaling in the Win.le../Wnr pathway suggests that 
PSl is a downstream elemem in the GT24-Wingles5^ pathway. This can be 
exploited to create a bioassay for drugs affecting the GT24.PS1 interaction directly, or 
20 affecting upstream or downstream components of that interaction pathway, and can 
therefore be used to monitor the effects of presenilin mutations. For example, cells 
transfected with normal or mutant presenilins may be exposed to soluble Wnt-3a 
protein (or other Wm proteins such as Wnt-1) and assayed for changes which are 
specific to the Win^lessAVnt signaling pathway, or for any of the other changes 
25 described herein for cell assays (e.g.. intracellular ion levels. AP processing, 
apoptosis, etc.). 

Thus, the GT24 protein also presents new targets for diagnosis as well as 
therapeutic intervention in AD. For example, as mutations in the GT24 protein may 
also be causative of Alzheimer's Disease, additional diagnostic assays are provided for 
30 detecting mutations in these sequences. Similarly, additional transfomied cell lines 
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and transgenic models may now be provided which have been altered by introduction 
of a nomial or mutant nucleic acid encoding at least a functional domain of the GT24 
protein, and particularly the functional domains (e.g., residues 70-377) which interact 
with the presenilins. Such transformed cells and transgenics will have utility in assays 
5 for compounds which modulate the presemlin-GT24 interactions. 

C. P0Q71: A Protein with "Armadillo" Repeats 

Another independent clone isolated in the initial screening with the wild 
type PS 1 266^09 "bait" also encodes a peptide with C-teraiinal arm repeats (clone 
Y2H25, Accession number U81005). A longer cDNA sequence corresponding to the 

10 Y2H25 clone has been deposited with GenBank as human protein p007 1 (Accession 
number X81889) and is reproduced herein as SEQ ID NO: 5. Clone Y2H25 
corresponds essentially to nucleotide positions 1682-1994 of SEQ ID NO: 5. 
Comparison of the predicted sequence of the Y2H25/p0071 ORF vnih that of GT24 
confirms that they are related proteins with 47% overall amino acid sequence identity, 

15 and with 70% identity between residues 346-862 of GT24 and residues 509-1 022 of 
p0071 . This suggests that PSl interacts with a novel class of mn repeat containing 
proteins. The broad --4.5 kb hybridization signal obtained on Northern blots with the 
unique 5* end of GT24 could reflect either alternative splicing/polyadenylation of 
GT24 or, less likely, the existence of additional members of this family with higher 

20 degrees of N-terminal homology to GT24 than p0071 , Cells transformed with these 
sequences, or transgenic animals including these sequences, will have additional 
utility as animal models of AD and for use in screening for compounds which 
modulate the action of normal and mutant presenilins. 

D. Rab II 

25 One clone (Y2H9), disclosed herein as SEQ ID NO: 5, was identified as 

interacting with the normal PSl TM6">7 loop domain and appears to correspond to a 
known gene, Rabl 1, available through Accession numbers X56740 and X53143. 
Rab 11 is believed to be involved in protein/vesicle trafficking in the ER/Golgi. Note 
the possible relationship to processing of membrane proteins such as flAPP and Notch 
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with resultant overproduction of toxic AB peptides (especially neurotoxic AC,^^,,,, 
isoforms) (Scheuner, et al, 1995). 

E. Retinoid X Receotor-p 

One clone (Y2H23b), disclosed herein as SEQ ID NO: 6. was identified as 
interacting with the normal PSl TM6-.7 loop domain and appears to correspond to a 
icnown gene, known variously as the retinoid X receptor-p. nuclear receptor co- 
regulator, or MHC Class I regulatory element, and is available through Accession 
numbers M84820. X63522 and M81766. This gene is believed to be involved in 
intercellular signaling, suggesting a possible relationship to the intercellular signaling 
function mediated by Celegans sell2 and Notch/lin-I2 (transcription activator). 
F- Cytoplasmic Chaperonin 

One clone (Y2H27), disclosed herein as SEQ ID NO: 8, was identified as 
interacting with the nonnal PS 1 TM6->7 loop domain and appears to correspond to a 
known gene, a cytoplasmic chaperonin containing TCP-l, available through 
15 Accession numbers Ul 7104 and X74801. 

G. Clone Y2H3S 

One clone (Y2H35), disclosed herein as SEQ ID NO: 7, was identified as 
interacting with the normal PSl TM6-.7 loop domain and appears to correspond to a 
sequence that codes for a protem of unknown function, available through Accession 
number R12984, but which displays evolutionary conservation in yeast sequences. 

H. Clone Y2H 171 

One clone (Y2H171), disclosed herein as SEQ ID NO: 9, was identified as 
interacting with the noimal PSl TM6-.7 loop domain and appears to correspond to a 
known expressed repeat sequence available through Accession number D55326. 
25 I. Clone Y2H41 

y 

One clone (Y2H41) was idemified which reacts strongly with the TM6->-7 
loop domains of both PSl and PS2 as well as the mutant loop domains of PSl. The 
sequence, disclosed as SEQ ID NO: 10, shows strong homology to an EST of 
unknown function (Accession number T64843). 



20 
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III. Preferred Embodiments 

Based, in part, upon the discoveries disclosed and described herein, the 
following preferred embodiments of the present invention are provided. 

5 

1. Isolated Nucleic Acids 

In one series of embodiments, the present invention provides isolated 
nucleic acids corresponding to, or relating to, the nucleic acid sequences disclosed 
herein, which encode at least the PS-interacting domain of a PS-interacting protein. 

10 As described more fully below, the disclosed and enabled sequences include normal 
sequences from humans and other mammalian species, mutant sequences from 
humans and other mammalian species, homologous sequences from non-mammalian 
species such as Drosophila and C. elegans, subsets of these sequences useful as probes 
and PCR primers, subsets of these sequences encoding fragments of the PS-interacting 

15 proteins or corresponding to particular structural domains or polymoiphic regions, 
complementary or antisense sequences corresponding to fragments of the PS- 
interacting protein genes, sequences in which the PS-interacting protein coding 
regions have been operably joined to exogenous regulatory regions, and sequences 
encoding fusion proteins in which portions of the PS -interacting proteins are fiised to 

20 other proteins useful as markers of expression, as "tags" for purification, or in screens 
and assays for other proteins which interact with the PS-interacting proteins. 

Thus, in a first series of embodiments, isolated nucleic acid sequences are 
provided which encode at least a PS-interacting domain of a normal or mutant version 
of a PS-interacting protein. Examples of such nucleic acid sequences are disclosed 

25 herein as SEQ ID NOs: 1, 3, and 5-10. In addition, given the sequences of the PS- 
interacting domains of the PS-interacting proteins disclosed herein, one of ordinary 
skill in the art is clearly enabled to obtain the entire genomic or cDNA sequence 
encoding the entire PS-interacting proteins. Thus, for example, based upon the initial 
clone of the GT24 protein obtained using the yeast two-hybrid system (Example 1), 

30 the larger GT24 clone disclosed as SEQ ID NO: 3 was obtained by standard methods 
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known in the art. Complete cDNA or genomic clones of each of the genes encoding 
the disclosed sequences may be similarly obtained by one of oniinaiy skill in the art. 
Therefore, the present invention provides complete genomic sequences as well as 
cDNA sequences corresponding to the PS-interacting protein genes of the invention. 
Alternatively, the nucleic acids of the invention may comprise recombinant genes or 
"minigenes" in which all or some introns of the PS-interacting protein genes have 
been removed, or in which various combinations of introns and exons and local cis- 
acting regulatory elements have been engineered in propagation or expression 
constructs or vectors. For purposes of reducing the size of a recombinant PS- 
interacting protein gene, a cDNA gene may be employed, or various combinations of 
introns and untranslated exons may be removed from a DNA construct. These and 
many variations on these embodiments are now enabled by the idemification and 
description of the PS-interacting proteins provided herein. 

In addition to the disclosed PS-interacting protein and gene sequences, one 
of ordinary skill in the art is now enabled to identify and isolate nucleic acids 
representing PS-interacting genes or cDNAs which are allelic to the disclosed 
sequences or which are heterospecific homologues. Thus, the present invention 
provides isolated nucleic acids corresponding to these alleles and homologues, as well 
as the various above-described recombinant constructs derived from these sequences, 
by means which are well known in the art. Briefly, one of ordinary skill in the art 
may now screen preparations of genomic or cDNA, including samples prepared from 
individual organisms (e.g., human AD patients or their family members) as well as 
bacterial, viral, yeast or other libraries of genomic or cDNA, using probes or PCR 
primers to identify allelic or homologous sequences. Because it is desirable to 
identify mutations in the PS-interacting proteins which may contribute to the 
development of AD or other disorders, because it is desirable to idemify 
polymorphisms in the PS-interacting proteins which are not pathogenic, and because 
it is also desirable to create a variety of animal models which may be used to study 
AD and screen for potential therapeutics, it is particularly contemplated that additional 
PS-imeracting protein sequences will be isolated from other preparations or libraries 
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of human nucleic acids and from preparations or libraries from animals including rats, 
mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non-human 
primates. Furthermore, PS-interacting protein homologues from yeast or invertebrate 
species, including C. elepans and other nematodes, as well as Drosophila and other 
5 insects, may have particular utility for drug screening. 

Standard hybridization screening or PCR techniques may be employed (as 
used, for example, in the identification of the mPSl gene disclosed in PCT 
Publication WO96/34099) to identify and/or isolate such allelic and homologous 
sequences using relatively short PS-interacting protein gene sequences. The 

10 sequences may include 8 or fewer nucleotides depending upon the nature of the target 
sequences, the method employed, and the specificity required. Future technological 
developments may allow the advantageous use of even shorter sequences. With 
current technology, sequences of 9-50 nucleotides, and preferably about 18-24 are 
preferred. These sequences may be chosen from those disclosed herein, or may be 

15 derived from other allelic or heterospecific homologues enabled herein. When 
probing mRNA or screening cDNA libraries, probes and primers from coding 
sequences (rather than introns) are preferably employed, and sequences which are 
omitted in alternative splice variants typically are avoided unless it is specifically 
desired to identify those variants. Allelic variants of the PS-interacting protein genes 

20 may be expected to hybridize to the disclosed sequences under stringent hybridization 
conditions, as defined herein, whereas lower stringency may be employed to identify 
heterospecific homologues. 

In another series of embodiments, the present invention provides for 
isolated nucleic acids which include subsets of the PS-interacting protein sequences or 

25 their complements. As noted above, such sequences will have utility as probes and 
PCR primers in the identification and isolation of allelic and homologous variants of 
the PS-interacting protein genes. Subsequences corresponding to polymorphic 
regions of the PS-interacting proteins, will also have particular utility in screening 
and/or genotyping individuals for diagnostic purposes, as described below. In 

30 addition, and also as described below, such subsets will have utility for encoding (1) 
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fragments of the PS-interacting proteins for inclusion in fusion proteins, (2) fragments 
which comprise functional domains of the PS-interacting proteins for use in binding 
studies, (3) fragments of the PS-interacting proteins which may be used as 
immunogens to raise antibodies against the PS-interacting proteins, and (4) fragments 
5 of the PS-interacting proteins which may act as competitive inhibitors or as mimetics 
of the PS-interacting proteins to inhibit or mimic their physiological fimctions. 
Finally, such subsets may encode or represent complementary or antisense sequences 
which can hybridize to the PS-interacting protein genes or PS-interacting protein 
niRNA transcripts under physiological conditions to inhibit the transcription or 
10 translation of those sequences. Therefore, depending upon the intended use, the 
present invention provides nucleic acid subsequences of the PS-interacting protein 
genes which may have lengths varying from 8-10 nucleotides (e.g., for use as PCR 
primers) to nearly the fiill size of the PS-interacting protein genomic or cDNAs. 
Thus, the present invention provides isolated nucleic acids comprising sequences 
15 corresponding to at least 8-10, preferably 15, and more preferably at least 20 

consecutive nucleotides of the PS-interacting protein genes, as disclosed or otherwise 
enabled herein, or to their complements. As noted above, however, shorter sequences 
may be useful with different technologies. 

In another series of embodiments, the present invention provides nucleic 
acids in which the coding sequences for the PS-interacting proteins, with or without 
introns or recombinantly engineered as described above, are operably joined to 
endogenous or exogenous 5' and/or 3' regulatory regions. Using the present disclosure 
and standard genetic techniques (e.g., PCR extensions, targeting gene walking), one of 
ordinary skill in the art is now enabled to clone the 5' and/or 3" endogenous regulatory 
regions of any of the disclosed PS-interacting protein genes. Similarly, allelic 
variants of these endogenous reguJatoiy regions, as well as endogenous regulatory 
regions from other mammalian homologues, are similarly enabled without undue 
experimentation. Alternatively, exogenous regulatory regions (i.e., regulatory regions 
from a different conspecific gene or a heterospecific regulatory region) may be 
operably joined to the PS-interacting protein coding sequences in order to drive 
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expression. Appropriate 5' regulatory regions will include promoter elements and 
may also include additional elements such as operator or enhancer sequences, 
ribosome binding sequences, RNA capping sequences, and the like. The regulatory 
region may be selected from sequences that control the expression of genes of 
5 prokaryotic or eukaryotic cells, their viruses, and combinations thereof Such 

regulatory regions include, but are not limited to, the lac system, the tip system, the 
tac system, and the trc system; major operator and promoter regions of phage X; the 
control region of the fd coat protein; early and late promoters of SV40; promoters 
derived from polyoma, adenovirus, retrovirus, baculovirus, and simian virus; 3- 
10 phosphoglycerate kinase promoter; yeast acid phosphatase promoters; yeast alpha- 
mating factors; promoter elements of other eukaryotic genes expressed in neurons or 
other cell types; and combinations thereof. In particular, regulatory elements may be 
chosen which are inducible or repressible (e.g., the p-galactosidase promoter) to allow 
for controlled and/or manipulable expression of the PS-interacting protein genes in 
15 cells transformed with these nucleic acids. Altematively, the PS-interacting protein 
coding regions may be operably joined with regulatory elements which provide for 
tissue specific expression in multicellular organisms. Such constructs are particularly 
useful for the production of transgenic organisms to cause expression of the PS- 
interacting protein genes only in appropriate tissues. The choice of appropriate 
20 regulatory regions is within the ability and discretion of one of ordinary skill in the art 
and the recombinant use of many such regulatory regions is now established in the ait. 

In another series of embodiments, the present invention provides for 
isolated nucleic acids encoding all or a portion of the PS-interacting proteins in the 
form of a fusion protein. In these embodiments, a nucleic acid regulatory region 
25 (endogenous or exogenous) is operably joined to a first coding region which is 
CO valently joined in-frame to a second coding region. The second coding region 
optionally may be covalently joined to one or more additional coding regions and the 
last coding region is joined to a termination codon and, optionally, appropriate 3* 
regulatory regions (e.g., polyadenylation signals). The PS-interacting protein 
30 sequences of the fusion protein may represent the first, second, or any additional 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



-29- 



10 



25 



30 



coding regions. The PS-interacting protein sequences may be conserved or non- 
conserved domains and can be placed in any coding region of the fusion. The non- 
PS-interacting protein sequences of the fusion may be chosen according to the needs 
and discretion of the practitioner and are not limited by the present invention. Useful 
non-PS-interacting protein sequences include, for example, short sequence "tags" such 
as antigenic determinants or poly-His tags which may be used to aid in the 
identification or purification of the resultant fusion protein. Alternatively, the non- 
PS-interacting protein coding region may encode a large protein or protein fragment, 
such as an enzyme or binding protein which also may assist in the identification and 
purification of the protein, or which may be useful in an assay such as those described 
below. Particularly contemplated fusion proteins include poly-His and GST 
(glutathione S-transferase) fiisions which are useful in isolating and purifying the 
presenilins-imeracting proteins, and the yeast two hybrid fusions, described below, 
which are useful in assays to identify other proteins which bind to or interact witii the 
15 PS-interacting proteins. 

In another series of embodiments, tiie present invention provides isolated 
nucleic acids in the form of recombinant DNA constructs in which a marker or 
reporter gene (e.g., P-galactosidase, luciferase) is operably joined to the 5- regulatory 
region of a PS-interacting protein gene such that expression of the marker gene is 
under the control of those regulatory sequences. Using the PS-interacting protein 
regulatory regions enabled herein, including regulatory regions from human and otiier 
mammalian species, one of ordinary skill in the art is now enabled to produce such 
constructs. As discussed more fully below, such isolated nucleic acids may be used to 
produce cells, cell lines or transgenic animals which are useful in the identification of 
compounds which can, directiy or indirectiy. differentially affect the expression of the 
PS-interacting proteins. 

Finally, the isolated nucleic acids of the present invention include any of 
tiie above described sequences when included in vectors. Appropriate vectors include 
cloning vectors and expression vectors of all types, including plasmids, phagemids, 
cosmids, episomes, and the like, as well as integration vectors. The vectors may also 
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include various marker genes (e.g., antibiotic resistance or susceptibility genes) which 
are useful in identifying cells successfully transfonmed therewith. In addition, the 
vectors may include regulatory sequences to which the nucleic acids of the invention 
are operably joined, and/or may also include coding regions such that the nucleic 
5 acids of the invention, when appropriately ligated into the vector, are expressed as 
fusion proteins. Such vectors may also include vectors for use in yeast "two hybrid," 
baculovirus, and phage-display systems. The vectors may be chosen to be useful for 
prokaryotic, eukaryotic or viral expression, as needed or desired for the particular 
application. For example, vaccinia virus vectors or simian virus vectors with the 

10 S V40 promoter (e.g., pSV2), or Herpes simplex virus or adeno-associated virus may 
be useful for transfection of mammalian cells including neurons in culture or in vivo , 
and the baculovirus vectors may be used in transfecting insect cells (e.g., butterfly 
cells). A great variety of different vectors are now commercially available and 
otherwise known in the art, and the choice of an ^propriate vector is within the 

15 ability and discretion of one of ordinary skill in the art. 

2. Substantially Pure Proteins 

The present invention provides for substantially pure preparations of the 
PS-interacting proteins, fragments of the PS-interacting proteins, and fusion proteins 

20 including the PS-interacting proteins or fragments thereof The proteins, fragments 
and fusions have utility, as described herein, in the generation of antibodies to normal 
and mutant PS-interacting proteins, in the identification of proteins (aside from the 
presenilins) which bind to the PS-interacting proteins, and in diagnostic and 
therapeutic methods. Therefore, depending upon the intended use, the present 

25 invention provides substantially pure proteins or peptides comprising amino acid 

sequences which are subsequences of the complete PS-interacting proteins and which 
may have lengths varying from 4-10 amino acids (e.g., for use as immunogens), or 10- 
100 amino acids (e.g., for use in binding assays), to the complete PS-interacting 
proteins. Thus, the present invention provides substantially pure proteins or peptides 

30 comprising sequences corresponding to at least 4-5, preferably 6-10, and more 
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preferably at least 50 or 100 consecutive amino acids of the PS-interacting proteins, as 
disclosed or otherwise enabled herein. 

The proteins or peptides of the invention may be isolated and purified by 
any of a variety of methods selected on the basis of the properties revealed by their 
protein sequences. For example, the PS-interacting proteins may be isolated from 
cells in which the PS-interacting protein is normally highly expressed. Alternatively 
the PS-interacting protein, fusion protein, or fragment thereof, may be purified from 
cells transformed or transfected with expression vectors (e.g., baculovirus systems 
such as the pPbac and pMbac vectors (Stratagene, La JoUa, CA); yeast expression 
systems such as the pYESHIS Xpress vectors (Invitrogen. San Diego, CA); eukaryotic 
expression systems such as pcDNAS (Invitrogen, San Diego, CA) which has constant 
constitutive expression, or LacSwitch (Stratagene. La Jolla, CA) which is inducible; 
or prokaryotic expression vectors such as pKK233-3 (Clontech. Palo Alto. CA). In 
the event that the protein or fragment integrates into the endoplasmic reticulum or 
plasma membrane of the recombinant cells (e.g.. eukaryotic cells), the protein may be 
purified from the membrane fraction. Alternatively, if the protein aggregates in 
inclusion bodies within the recombinant cells (e.g.. prokaryotic cells), the protein may 
be purified from whole lysed cells or from solubilized inclusion bodies. 

Purification can be achieved using standard protein purification procedures 
including, but not limited to. gel-filtration chromatography, ion-exchange 
chromatography, high-performance liquid chromatography (RP-HPLC. ion-exchange 
HPLC, size-exclusion HPLC, high-perfomiance chromatofocusing chromatography, 
hydrophobic interaction chromatography, immunoprecipitation, or immunoaffmity 
purification. Gel electrophoresis (e.g., PAGE, SDS-PAGE) can also be used to isolate 
a protein or peptide based on its molecular weight, charge properties and 
hydrophobicity. 

A PS-interacting protein, or a fragment thereof, may also be conveniently 
purified by creating a fiision protein including the desired PS-interacting protein 
sequence fiised to another peptide such as an antigenic determinant or poly-His tag 
(e.g.. QIAexpress vectors, QIAGEN Corp., Chatsworth, CA), or a larger protein (e.g.. 
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GST using the pGEX-27 vector (Amrad, USA) or green fluorescent protein using the 
Green Lantern vector (GIBCO/BRL. Gaithersburg, MD). The fusion protein may be 
expressed and recovered from prokaryotic or eukaryotic cells and purified by any 
standard method based upon the fusion vector sequence. For example, the fusion 
5 protein may be purified by immunoaffinity or immunoprecipitation with an antibody 
to the non-PS-interacting protein portion of the fusion or, in the case of a poly-His tag, 
by affinity binding to a nickel column. The desired PS-interacting protein or fragment 
may then be further purified from the fusion protein by enzymatic cleavage of the 
fusion protein. Methods for preparing and using such fusion constructs for the 
10 pimfication of proteins are well known in the art and several kits are commercially 
available for this purpose. In light of the present disclosure, one is now enabled to 
employ such fusion constructs with the PS-interacting proteins. 

3. Antibodies to the PS-interacting Proteins 

15 The present invention also provides antibodies, and methods of making 

antibodies, which selectively bind to the PS-interacting proteins or fragments thereof 
Of particular importance, by identifying the PS-interacting domains of the PS- 
interacting proteins, and methods of identifying mutant forms of the PS-interacting 
proteins associated with Alzheimer's Disease, the present invention provides 

20 antibodies, and methods of making antibodies, which will selectively bind to and, 
thereby, identify and/or distinguish normal and mutant (i.e., pathogenic) forms of the 
PS-interacting proteins. The antibodies of the invention have utility as laboratory 
reagents for, inter alia , immunoaffinity purification of the PS-interacting proteins. 
Western blotting to identify cells or tissues expressing the PS-interacting proteins, and 

25 immunocytochemistry or immunofluorescence techniques to establish the subcellular 
location of the proteins. In addition, as described below, the antibodies of the 
invention may be used as diagnostics tools to identify carriers of AD-related PS- 
interacting protein alleles, or as therapeutic tools to selectively bind and inhibit 
pathogenic forms of the PS-interacting proteins in vivo . 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



•33- 



10 



15 



30 



The antibodies of the invention may be generated using the entire PS- 
interacting proteins of the invention, or using any PS-interacting protein epitope 
which is characteristic of that protein and which substantially distinguishes it from 
other host proteins. Any method of choosing antigenic detenninants known in the art 
may, of course, be employed. Such epitopes may be identified by comparing 
sequences of, for example, 4-10 amino acid residues from a PS-interacting protein 
sequence to computer databases of protein sequences from the relevant host. In 
addition, larger fragments (e.g., 8-20 or, preferably, 9-15 residues) including one or 
more potential epitopes may also be employed. Antibodies to the PS-interacting 
domains (identified by the yeast two-hybrid assays described below) are expected to 
have the greatest utility both diagnosUcally and therapeutically. On the other hand, 
antibodies against highly conserved domains are expected to have the greatest utility 
for purification or identification of PS-interacting proteins. 

PS-interacting protein immunogen preparations may be produced from 
crude extracts (e.g., lysates or membrane fractions of cells highly expressing the 
proteins), from proteins or peptides substantially purified from cells which naturally 
or recombinantly express them or, for short immunogens, by chemical peptide 
synthesis. The immunogens may also be in the forni of a fusion protein in which the 
non-PS-interacting protein region is chosen for its adjuvant properties. As used 
20 herein, a PS-interacting protein immunogen shall be defined as a preparation 

including a peptide comprising at least 4-8, and preferably at least 9-15 consecutive 
amino acid residues of a PS-interacting proteins, as disclosed or otherwise enabled 
herein. Sequences of fewer residues may, of course, also have utility depending upon 
the intended use and fiiture technological developments. Therefore, any PS- 
interacting protein derived sequences which are employed to generate antibodies to 
the PS-interacting proteins should be regarded as PS-imeracting protein immunogens. 

The antibodies of the invention may be polyclonal or monoclonal, or may 
be antibody fragments, including Fab fragments. F(ab%. and single chain antibody 
fragments. In addition, after identifying usefiil antibodies by the method of the 
invention, recombinant antibodies may be generated, including any of the antibody 
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fragments listed above, as well as humanized antibodies based upon non-human 
antibodies to the PS-interacting proteins. In light of the present disclosure, as well as 
the characterization of other PS-interacting proteins enabled herein, one of ordinary 
skill in the art may produce the above-described antibodies by any of a variety of 
5 standard means well known in the art. For an overview of antibody techniques, see 
Antibody Engineerinp: A Practical Guide . Borrebaek, ed., W.H. Freeman & 
Company, NY (1992), or Antibody Engineering . 2nd Ed., Borrebaek, ed.. Oxford 
University Press, Oxford (1995). 

As a general matter, polyclonal antibodies may be generated by first 

10 immunizing a mouse, rabbit, goat or other suitable animal with the PS-interacting 
protein immunogen in a suitable carrier. To increase the immunogenicity of the 
preparation, the immunogen may be coupled to a carrier protein or mixed with an 
adjuvant (e.g., Freund's adjuvant). Booster injections, although not necessary are 
reconmiended. After an appropriate period to allow for the development of a humoral 

15 response, preferably several weeks, the animals may be bled and the sera may be 
purified to isolate the immunoglobulin component. 

Similarly, as a general matter, monoclonal anti-PS-interacting protein 
antibodies may be produced by first injecting a mouse, rabbit, goat or other suitable 
animal with a PS-interacting protein immimogen in a suitable carrier. As above, 

20 carrier proteins or adjuvants may be utilized and booster injections (e.g., bi- or tri- 
weekly over 8-10 weeks) are recommended. After allowing for development of a 
humoral response, the animals are sacrificed and their spleens are removed and 
resuspended in, for example, phosphate buffered saline (PBS). The spleen cells serve 
as a source of lymphocytes, some of which are producing antibody of the appropriate 

25 specificity. These cells are then fiised with an immortalized cell line (e.g., myeloma), 
and the products of the fiision are plated into a number of tissue culture wells in the 
presence of a selective agent such as HAT. The wells are serially screened and 
replated, each time selecting cells making useful antibody. Typically, several 
screening and replating procedures are carried out until over 90% of the wells contain 

30 single clones which are positive for antibody production. Monoclonal antibodies 
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produced by such clones may be purified by standard methods such as affinity 
chromatography using Protein A Sepharose. by ion-exchange chromatography, or by 
variations and combinations of these techniques. 

The antibodies of the invention may be labelled or conjugated with other 
5 compounds or materials for diagnostic and/or therapeutic uses. For example, they 
may be coupled to radionuclides, fluorescem compounds, or enzymes for imaging or 
therapy, or to liposomes for the targeting of compounds contained in the liposomes to 
a specific tissue location. 

10 4. Transformed Cell Lines 

The presem invention also provides for cells or cell hnes, both prokaryotic 
and cukaryotic. which have been transfomied or transfected with the nucleic acids of 
the present invention so as to cause clonal propagation of those nucleic acids and/or 
expression of the proteins or peptides encoded thereby. Such cells or cell lines will 
have utihty both in the propagation and production of the nucleic acids and proteins of 
the present invention but also, as further described herein, as model systems for 
diagnostic and therapeutic assays. In particular, it is expected that cells co- 
transformed with PS-imeracting protein sequences as well as presenilin sequences will 
have improved utility as models of the biochemical pathways which may be affected 
in AD. For example, cells co-transformed with the interacting domains of PS- 
interacting sequences and presenilins in yeast two-hybrid fusion constructs, will have 
utility in screening for compounds which either enhance or inhibit interactions 
between these domains. Similarly, for cells transformed with a heterospecific 
presenilin, co-transformation with a similarly heterospecific PS-imeracting protein, or 
co-transformation and homologous recombination to introduce a similarly 
heterospecific PS-interacting domain of a PS-interacting protein (e.g., "humanizing" a 
non-human endogenous PS-interacting protein), will result in a better model system 
for studying the interactions of the presenilins and the PS-interacting proteins. Cells 
transfonned with only PS-interacting sequences will, of course, have utility of their 
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own for studying the role of these proteins in the etiology of AD, and also as 
precursors for presenilin co-transformed cells. 

As used herein, the term "transfomied cell" is intended to embrace any 
cell, or the descendant of any cell, into which has been introduced any of the nucleic 
5 acids of the invention, whether by transformation, transfection, infection, or other 
means. Methods of producing appropriate vectors, transforming cells with those 
vectors, and identifying transformants are well known in the art and are only briefly 
reviewed here (see, for example, Sambrook et al. (1989) Molecular Cloning: A 
Laboratory Manual. 2nd ed.. Cold Spring Harbor Laboratory Press, Cold Spring 

10 Harbor, New York). 

Prokaryotic cells useful for producing the transformed cells of the 
invention include members of the bacterial genera Escherichia (e.g., E. coli ). 
Pseudomonas (e.g., P. aerueinosaV and Bacillus (e.g., B. subtillus . ^ 
stearothemophilus) . as well as many others well known and frequently used in the 

15 art. Prokaryotic cells are particularly useful for the production of large quantities of 
the proteins or peptides of the invention (e.g., normal or mutant PS-interacting 
proteins, fragments of the PS-interacting proteins, fusion proteins of the PS- 
interacting proteins). Bacterial cells (e.g., E. coli) may be used with a variety of 
expression vector systems including, for example, plasmids with the T7 RNA 

20 polymerase/promoter system, bacteriophage X regulatory sequences, or Ml 3 Phage 
mGPI-2. Bacterial hosts may also be transformed with fusion protein vectors which 
create, for example, lacZ, trpE, maltose-binding protein, poly-His tags, or glutathione- 
s-transferase fusion proteins. All of these, as well as many other prokaryotic 
expression systems, are well known in the art and widely available commercially 

25 (e.g., pGEX-27 (Amrad, USA) for GST fusions). 

Eukaryotic cells and cell lines useful for producing the transformed cells of 
the invention include manunalian cells and cell lines (e.g., PC 12, COS, CHO, 
fibroblasts, myelomas, neuroblastomas, hybridomas, himian embryonic kidney 293, 
oocytes, embryonic stem cells), insect cells lines (e.g., using baculovirus vectors such 

30 as pPbac or pMbac (Stratagene, La Jolla, CA)), yeast (e.g., using yeast expression 
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vectors such as pYESHlS (Invitrogen, CA)), and fungi. Eukaiyotic cells are 
particularly useful for embodiments in which it is necessary that the PS-interacting 
proteins, or functional fragments thereof, perform the functions and/or undergo the 
intracellular interactions associated with either the normal or mutant proteins. Thus, 
5 for example, transformed eukaiyotic cells are preferred for use as models of PS- 
interacting protein function or interaction, and assays for screening candidate 
therapeutics preferably employ transformed eukaiyotic cells. 

To accompUsh expression in eukaryotic cells, a wide variety of vectors 
have been developed and are commercially available which allow inducible (e.g.. 
10 LacSwitch expression vectors, Stratagene. La Jolla, CA) or cognate (e.g., pcDNAS 
vectors. Invitrogen. Chatsworth, CA) expression of PS-interacting protein nucleotide 
sequences under the regulation of an artificial promoter element. Such promoter 
elements are often derived from CMV or SV40 viral genes, although other strong 
promoter elements which are active in eukaiyotic cells can also be employed to induce 
15 transcription of PS-interacting protein nucleotide sequences. Typically, these vectors 
also contain an artificial polyadenylation sequence and 3* UTR which can also be 
derived from exogenous viral gene sequences or from other eukaryotic genes. 
Furthermore, in some constructs, artificial, non-coding, spliceable introns and exons 
are included in the vector to enhance expression of the nucleotide sequence of interest. 
These expression systems are commonly available fixjm commercial sources and are 
typified by vectors such as pcDNA3 and pZeoSV (Invitrogen. San Diego. CA). 
Imiumerable commercially-available as well as custom-designed expression vectors 
are available from commercial sources to allow expression of any desired PS- 
interacting protein transcript in more or less any desired cell type, either constitutively 
or after exposure to a certain exogenous stimulus (e.g., withdrawal of tetracycline or 
exposure to IPTG). 

Vectors may be introduced into the recipient or "host" cells by various 
methods well known in the art including, but not limited to, calcium phosphate 
transfection. strontium phosphate transfection. DEAE dextran transfection. 
electroporation, lipofection (e.g., Dosper Liposomal transfection reaeem, Boehringer 
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Mannheim, Germany), microinjection, ballistic insertion on micro-beads, protoplast 
fusion or, for viral or phage vectors, by infection with the recombinant virus or phage. 

5. Transgenic Animal Models 
5 The present invention also provides for the production of transgenic non- 

human animal models in which mutant or wild type PS-interacting protein sequences 
are expressed, or in which the PS-interacting protein genes have been inactivated 
(e.g., "knock-out" deletions), for the study of Alzheimer's Disease, for the screening of 
candidate pharmaceutical compounds, for the creation of explanted mammalian CNS 

10 cell cultures (e.g., neuronal, glial, organotypic or mixed cell cultures), and for the 
evaluation of potential therapeutic interventions. Prior to the present invention, a 
partial animal model for Alzheimer's Disease existed via the insertion and over- 
expression of a mutant form of the human amyloid precursor protein gene as a 
minigene under the regulation of the platelet-derived growth factor P receptor 

15 promoter element (Games et al., 1995). This mutant ((iAPP,,, Val-^Ile) causes the 
appearance of synaptic pathology and amyloid p peptide deposition in the brain of 
transgenic animals bearing this transgene in high copy number. These changes in the 
brain of the transgenic animal are very similar to that seen in human AD (Games et 
al., 1995). It is, however, as yet unclear whether these animals become demented, but 

20 there is general consensus that it is now possible to recreate at least some aspects of 
AD in mice. In addition, transgenic animal models in which the presenilin genes are 
genetically engineered are disclosed in PCT Publication WO96/34099. These 
transgenic animal models have been shown to have altered Ap production and altered 
hippocampus-dependent memory function. 

25 Animal species suitable for use in the animal models of the present 

invention include, but are not limited to, rats, mice, hamsters, guinea pigs, rabbits, 
dogs, cats, goats, sheep, pigs, and non-human primates (e.g.. Rhesus monkeys, 
chimpanzees). For initial studies, transgenic rodents (e.g., mice) may be preferred due 
to their relative ease of maintenance and shorter life spans. However, transgenic yeast 

30 or invertebrates (e.g., nematodes, insects) may be preferred for some studies because 
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they will allow for even more rapid and inexpensive screening. For example, 
invertebrates bearing mutant PS-interacting protein homologues (or mammalian PS- 
interacting protein transgenes) which cause a rapidly occurring and easily scored 
phenotype (e.g., abnormal vulva or eye development after several days) can be used as 

5 screens for drugs which block the effect of the mutant gene. Such invertebrates may 
prove far more rapid and efficient for mass screenings than larger vertebrate animals. 
Once lead compounds are found through such screens, they may be tested in higher 
animals such a rodents. Ultimately, transgenic non-human primates may be preferred 
for longer term studies due to their greater similarity to humans and their higher 

D cognitive abilities. 

Using the nucleic acids disclosed and otherwise enabled herein, there are 
now several available approaches for the creation of a transgenic animal model for 
Alzheimer's Disease. Thus, the enabled animal models include: (I) Animals in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 
sequences encoding at least a functional domain of a noimal human PS-interacting 
protein gene have been recombinantly substimted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
substitution of sequences encoding the human homologue by homologous 
recombination or gene targeting. These animals are useful for evaluating the effects 
of the transgenic procedures, and the effects of the introduction or substitution of a 
human or humanized PS-interacting protein gene. (2) Animals in which sequences 
encoding at least a functional domain of a mutant (i.e., pathogenic) human PS- 
interacting protein gene have been recombinantly introduced into the genome of the 
animal as an additional gene, under the regulation of either an exogenous or an 
endogenous promoter element, and as either a minigene or a large genomic fragment; 
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in which sequences encoding at least a functional domain of a mutant human PS- 
interacting protein gene have been recombinantly substituted for one or both copies of 
the animars homologous PS-interacting protein gene by homologous recombination 
or gene targeting; and/or in which one or both copies of one of the animal's 
5 homologous PS-interacting protein genes have been recombinantly "humanized" by 
the partial substitution of sequences encoding a mutant human homologue by 
homologous recombination or gene targeting. These animals are useful as models 
which will display some or all of the characteristics, whether at the biochemical, 
physiological and/or behavioral level, of humans carrying one or more alleles which 

10 are pathogenic of Alzheimer's Disease or other diseases associated with mutations in 
the PS-interacting protein genes. (3) Animals in which sequences encoding at least a 
functional domain of a mutant version of one of that animal's PS-interacting protein 
genes (bearing, for example, a specific mutation corresponding to, or similar to, one 
of the pathogenic mutations of the human PS-interacting proteins) have been 

15 recombinantly introduced into the genome of the animal as an additional gene, under 
the regulation of either an exogenous or an endogenous promoter element, and as 
either a minigene or a large genomic fragment; and/or in which sequences encoding at 
least a functional domain of a mutant version of one of that animal's PS-interacting 
protein genes (bearing, for example, a specific mutation corresponding to, or similar 

20 to, one of the pathogenic mutations of the human PS-interacting proteins) have been 
recombinantly substituted for one or both copies of the animal's homologous PS- 
interacting protein gene by homologous recombination or gene targeting. These 
animals are also useful as models which will display some or all of the characteristics, 
whether at the biochemical, physiological and/or behavioral level, of humans carrying 

25 one or more alleles which are pathogenic of Alzheimer's Disease. (4) "Knock-out" 
animals in which one or both copies of one of the animal's PS-interacting protein 
genes have been partially or completely deleted by homologous recombination or 
gene targeting, or have been inactivated by the insertion or substitution by 
homologous recombination or gene targeting of exogenous sequences (e.g., stop 

30 codons, lox p sites). Such animals are useful models to study the effects which loss of 
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PS-interacting protein gene expression may have, to evaluate whether loss of function 
is preferable to continued expression of mutant forms, and to examine whether other 
genes can be recruited to replace a mutant PS-interacting protein or to intervene with 
the effects of other genes (e.g., PSl, PS2, AFP or ApoE) causing AD as a treatment 
for AD or other disorders. For example, a normal PS-interacting protein gene may be 
necessary for the action of mutant presenilin or APP genes to actually be expressed as 
AD and, therefore, transgenic PS-interacting protein animal models may be of use in 
elucidating such multigenic interactions. 

In addition to transgenic animal models in which the expression of one or 
more of the PS-interacting proteins is altered, the present invention also provides for 
the production of transgenic animal models in which the expression of one or more of 
the presenilins. APP, or ApoE is altered. The nucleic acids encoding the presenilins. 
APP, and ApoE are known in the art, a methods for producing transgenic animals with 
these sequences are also known (see, e.g., PCT Publication WO96/34099; Games et 
al., 1 995). Indeed, because non-human animals may differ fiom humans not only in 
their PS-interacting protein sequences, but also in the sequences of their presenilin, 
APP and/or ApoE homologues, it is particularly contemplated that transgenics may be 
produced which bear recombinant nonmal or mutant human sequences for at least one 
presenilin, APP and/or ApoE gene in addition to recombinant sequences for one or 
more PS-interacting proteins. Such co-transfonned animal models would possess 
more elements of the human molecular biology and, therefore, are expected to be 
better models of human disorders. Thus, in accordance with the present invention, 
transgenic animal models may be produced bearing normal or mutant sequences for 
one or more PS-interacting proteins, or interacting domains of these proteins. These 
animals will have utility in that they can be crossed with animals bearing a variety of 
nonnal or mutant presenilin. APP or ApoE sequences to produce co-transformed 
animal models. Furthermore, as detailed below, it is expected that mutations in the 
PS-interacting genes, like mutations in the presenilins themselves, may be causative 
of Alzheimer's Disease and/or other disorders as well (e.g.. other cognitive, 
intellectual, neurological or psychological disorders such as cerebral hemorrhage. 
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schizophrenia, depression, mental retardation and epilepsy). Therefore, transgenic 
animal models bearing normal or mutant sequences corresponding to the PS- 
interacting proteins, absent transformation with any presenilin, APP or ApoE 
sequences, will have utility of their own in the study of such disorders. 
5 As detailed below, preferred choices for transgenic animal models 

transformed with PS-interacting proteins, or domains of PS-interacting proteins, 
include those transformed with normal or mutant sequences corresponding to the 
clones identified and described in Example 1 and disclosed in SEQ ED NOs: 1-12. 
These clones, which interact with normal or mutant PSl TM6->7 loop domains, were 
identified according to the methods described in Example 1 , below, and PCT 
Publication WO96/34099. These clones, longer nucleic acid sequences comprising 
these clones, and other clones identified according to this and other methods of the 
invention (e.g., allelic and splice variants or heterospecific homologues of these 
clones) may all be employed in accordance with the present invention to produce 
animal models which, with or without co-transformation with presenilin, APP and/or 
ApoE sequences, will have utility in the study of Alzheimer's Disease and/or other 
cognitive, intellectual, neurological or psychological disorders. 

Thus, using the nucleic acids disclosed and otherwise enabled herein, one 
of ordinary skill in the art may now produce any of the following types of transgenic 
animal models with altered PS-interacting protein expression: (1) Animals in which 
sequences encoding at least a fimctional domain of a normal human PS-interacting 
protein gene have been recombinantly introduced into the genome of the animal as an 
additional gene, imder the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic Augment; in which 
sequences encoding at least a fimctional domain of a nomial human PS-interacting 
protein gene have been recombinantly substituted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
substitution of sequences encoding the human homologue by homologous 
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recombination or gene targeting. These animals are particularly useful for providing 
transgenic models which express human PS-interacting proteins as well as human 
presenilin proteins. They are also useful in evaluating the effects of the transgenic 
procedures, and the effects of the introduction or substitution of a human or 
5 humanized PS-interacting protein gene. (2) Animals in which sequences encoding at 
least a functional domain of a mutant (i.e., pathogenic) human PS-interacting protein 
gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 
10 sequences encoding at least a functional domain of a mutant human PS-interacting 
protein gene have been recombinantly substituted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
15 substitution of sequences encoding a mutant human homologue by homologous 
recombination or gene targeting. These animals are useful as models which will 
display some or all of the characteristics, whether at the biochemical, physiological 
and/or behavioral level, of humans carrying one or more alleles which are pathogenic 
of Alzheimer's Disease or other diseases associated with mutations in these PS- 
mteracting genes. (3) Animals in which sequences encoding at least a functional 
domain of a mutant version of one of that animal's PS-interacting protein genes 
(bearing, for example, a specific mutation corresponding to, or similar to, one of the 
pathogenic mutations of the human PS-interacting proteins) have been recombinantly 
introduced into the genome of the animal as an additional gene, under the regulation 
25 of either an exogenous or an endogenous promoter element, and as either a minigene 
or a large genomic fragment; and/or in which sequences encoding at least a functional 
domain of a mutant version of one of that animal's PS-interacting protein genes 
(bearing, for example, a specific mutation corresponding to, or similar to, one of the 
pathogenic mutations of the humans PS-interacting proteins) have been recombinantly 
30 substimted for one or both copies of the animal's homologous PS-interacting protein 
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gene by homologous recombination or gene targeting. These animals are also useful 
as models which will display some or all of the characteristics, whether at the 
biochemical, physiological and/or behavioral level, of humans carrying one or more 
alleles which are pathogenic of Alzheimer's Disease. (4) "Knock-out" animals in 
5 which one or both copies of one of the animal's PS-interacting protein genes have 
been partially or completely deleted by homologous recombination or gene targeting, 
or have been inactivated by the insertion or substitution by homologous 
recombination or gene targeting of exogenous sequences (e.g., stop codons, lox p 
sites). Such animals are useful models to study the effects which loss of PS- 

10 interacting protein gene expression may have, to evaluate whether loss of function is 
preferable to continued expression, and to examine whether other genes can be 
recruited to replace a mutant PS-interacting protein or to intervene with the effects of 
other genes (e.g., APP or ApoE) causing AD as a treatment for AD or other disorders. 
For example, a normal PS-interacting protein may be necessary for the action of 

15 mutant PS 1 , PS2 or APP genes to actually be expressed as AD and, therefore, 

transgenic PS-interacting protein animal models may be of use in elucidating such 
multigenic interactions. 

. In some preferred embodiments, transgenic animal models are produced in 
which just the PS-interacting domains of the PS-interacting proteins are introduced 

20 into the genome of the animal by homologous recombination. Thus, for example, 
preferred embodiments include transgenic animals in which the PS-interacting 
domains of PS-interacting proteins are "humanized" by homologous recombination 
with sequences from human PS-interacting proteins. These animals may then be bred 
with transgenics in which normal or mutant presenilin sequences have been 

25 introduced. The progeny of these animals, having both human presenilin and human 
PS-interacting protein sequences, will provide improved animal models for 
Alzheimer's Disease. 

To create an animal model (e.g., a transgenic mouse), a normal or mutant 
PS-interacting gene (e.g., nonnal or mutant S5a, GT24, p0071 , Rabl 1 , etc.), or a 

30 normal or mutant version of a recombinant nucleic acid encoding at least a functional 
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domain of a PS-interacting gene (e.g.. the PS-interacting domains obtained in the 
yeast two-hybrid system), can be inserted into a germ line or stem cell using standard 
techniques of oocyte microinjection, or transfection or microinjection into embiyonic 
stem cells. Animals produced by these or similar processes are referred to as 
transgenic. Similarly, if it is desired to inactivate or iqjlace an endogenous presenilin 
or PS-interacting protein gene, homologous recombination using embryonic stem 
cells may be employed. Animals produced by these or similar processes are referred 
to as "knock-out" (inactivation) or "knock-in" (replacement) models. 

For oocyte injection, one or more copies of the recombinant DNA 
constructs of the present invention may be inserted into the pronucleus of a just- 
fertilized oocyte. This oocyte is then reimplanted into a pseudo-pregnant foster 
mother. The livebom animals are screened for integrants using analysis of DNA (e.g., 
from the tail veins of offspring mice) for the presence of the inserted recombinant 
transgene sequences. The transgene may be either a complete genomic sequence 
injected as a YAC. BAG, PAC or other chromosome DNA fragment, a cDNA with 
either the natural promoter or a heterologous promoter, or a minigene containing all of 
the coding region and other elements found to be necessary for optimum expression. 

Retroviral infection of early embryos can also be done to insert the 
recombinant DNA constructs of the invention. In this method, the transgene (e.g., a 
normal or mutant S5a, GT24. p0071, Rab II, etc., sequence) is inserted into a 
retroviral vector which is used to infect embryos (e.g.. mouse or non-human primate 
embryos) directly during the early stages of development to generate chimeras, some 
of which will lead to germline transmission. 

Homologous recombination using stem cells allows for the screening of 
25 gene transfer cells to identify the rare homologous recombination events. Once 
identified, these can be used to generate chimeras by injection of blastocysts, and a 
proportion of the resulting animals will show germline transmission from the 
recombinant line. This methodology is especially useful if inactivation of a gene is 
desired. For example, inactivation of the S5a gene in mice may be accomplished by 
designing a DNA fragment which contains sequences from an SSa coding region 
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flanking a selectable marker. Homologous recombination leads to the insertion of the 
marker sequences in the middle of the coding region, causing inactivation of the S5a 
gene and/or deletion of internal sequences. DNA analysis of individual clones can 
then be used to recognize the homologous recombination events. 
5 The techniques of generating transgenic animals, as well as the techniques 

for homologous recombination or gene targeting, are now widely accepted and 
practiced. A laboratory manual on the manipulation of the mouse embryo, for 
example, is available detailing standard laboratory techniques for the production of 
transgenic mice (Hogan et al., 1986). To create a transgene, the target sequence of 

10 interest (e.g., normal or mutant presenilin sequences, normal or mutant PS-interacting 
protein sequences) are typically ligated into a cloning site located downstream of 
some promoter element which will regulate the expression of RNA from the 
sequence. Downstream of the coding sequence, there is typically an artificial 
polyadenylation sequence. In the transgenic models that have been used to 

15 successfully create animals which mimic aspects of inherited human 

neurodegenerative diseases, the most successful promoter elements have been the 
platelet-derived growth factor recq)tor p gene subunit promoter and the hamster prion 
protein gene promoter, although other promoter elements which direct expression in 
central nervous system cells would also be useful. An alternate approach to creating a 

20 transgene is to use an endogenous presenilin or PS-interacting protein gene promoter 
and regulatory sequences to drive expression of the transgene. Finally, it is possible 
to create transgenes using large genomic DNA fragments such as YACs which 
contain the entire desired gene as well as its appropriate regulatory sequences. Such 
constructs have been successfully used to drive human APP expression in transgenic 

25 mice (Lamb et al., 1993). 

Animal models can also be created by targeting the endogenous presenilin 
or PS-interacting protein gene in order to alter the endogenous sequence by 
homologous recombination. These targeting events can have the effect of removing 
endogenous sequence (knock-out) or altering the endogenous sequence to create an 

30 amino acid change associated with hirnian disease or an otherwise abnormal sequence 
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(e.g., a sequence which is more like the human sequence than the original animal 
sequence) (knock-in animal models). A large number of vectors are available to 
accomplish this and appropriate sources of genomic DNA for mouse and other animal 
genomes to be targeted are commercially available from companies such as 
5 GenomeSystems Inc. (St. Louis. Missouri. USA). The typical feature of these 
targeting vector constructs is that 2 to 4 kb of genomic DNA is ligated 5' to a 
selectable marker (e.g., a bacterial neomycin resistance gene under its own promoter 
element termed a "neomycin cassette"). A second DNA fragment from the gene of 
interest is then ligated downstream of the neomycin cassette but upstream of a second 
10 selectable marker (e.g.. thymidine kinase). The DNA fragmems are chosen such that 
mutant sequences can be introduced imo the germ line of the targeted animal by 
homologous replaceraem of the endogenous sequences by either one of the sequences 
included in the vector. Alternatively, the sequences can be chosen to cause deletion of 
sequences that would normally reside between the left and right arms of the vector 
15 surrounding the neomycin cassette. The former is known as a knock-in, the latter is 
known as a knock-out. Again, innumerable model systems have been created, 
particularly for targeted knock-outs of genes including those relevant to 
neurodegenerative diseases (e.g.. targeted deletions of the murine APP gene by Zheng 
et al., 1995; targeted deletion of the murine prion gene associated with adult onset 
20 human CNS degeneration by Bueler et al.. 1 996). 

Finally, equivalents of transgenic animals, including animals with mutated 
or inactivated presenilin genes, or mutated or inactivated PS-interacting protein genes, 
may be produced using chemical or X-ray mutagenesis of gametes, followed by 
fertilization. Using the isolated nucleic acids disclosed or otherwise enabled herein. 
25 one of ordinary skill may more rapidly screen the resulting offspring by, for example, 
direct sequencing RFLP, PCR, or hybridization analysis to detect mutants, or 
Southern blotting to demonstrate loss of one allele by dosage. 

^- Assays for Drugs Which Affrrt PS-Interactina Protein Exnres^inn 
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In another series of embodiments, the present invention provides assays for 
identifying small molecules or other compounds which are capable of inducing or 
inhibiting the expression of the PS-interacting genes and proteins (e.g., S5a or GT24). 
The assays may be performed in vitro using non-transformed cells, immortalized cell 
5 lines, or recombinant cell lines, or in vivo using the transgenic animal models enabled 
herein. 

In particular, the assays may detect the presence of increased or decreased 
expression of S5a, GT24, p0071, Rab 1 1, or other PS-interacting genes or proteins on 
the basis of increased or decreased mRNA expression (using, e.g., the nucleic acid 

10 probes disclosed and enabled herein), increased or decreased levels of PS -interacting 
proteins (using, e.g., the anti-PS-interacting protein antibodies disclosed and enabled 
herein), or increased or decreased levels of expression of a marker gene (e.g., P- 
galactosidase or luciferase) operably joined to a PS-interacting protein 5' regulatory 
region in a recombinant construct. 

15 Thus, for example, one may culture cells known to express a particular PS- 

interacting protein and add to the culture medium one or more test compounds. After 
allowing a sufficient period of time (e.g., 0-72 hours) for the compound to induce or 
inhibit the expression of the PS-interacting protein, any change in levels of expression 
from an established baseline may be detected using any of the techniques described 

20 above and well known in the art. In particularly preferred embodiments, the cells are 
from an immortalized cell line such as a human neuroblastoma, glioblastoma or a 
hybridoma cell line. Using the nucleic acid probes and /or antibodies disclosed and 
enabled herein, detection of changes in the expression of a PS-interacting protein, and 
thus identification of the compound as an inducer or repressor of PS-interacting 

25 protein expression, requires only routine experimentation. 

In particularly preferred embodiments, a recombinant assay is employed in 
which a reporter gene such a (J-galactosidase, green fluorescent protein , alkaline 
phosphatase, or luciferase is operably joined to the 5' regulatory regions of a PS- 
interacting protein gene. Preferred vectors include the Green Lantern 1 vector 

30 (GIBCO/BRL, Gaithersburg, MD) and the Great EScAPe pSEAP vector (Clontech, 
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Palo Alto). The PS-interacting protein regulatory regions may be easily isolated and 
cloned by one of ordinary skill in the art in light of the present disclosure of coding 
regions from these genes. The reporter gene and regulatory regions are joined in- 
frame (or in each of the three possible reading frames) so that transcription and 
translation of the reporter gene may proceed under the control of the PS-interacting 
protein regulatory elements. The recombinant construct may then be introduced into 
any appropriate cell type, although mammalian cells are preferred, and human cells 
are most preferred. The transformed cells may be grown in culnire and, after 
establishing the baseline level of expression of the reporter gene, test compounds may 
be added to the medium. The ease of detection of the expression of the reporter gene 
provides for a rapid, high through-put assay for the identification of inducers and 
repressors of the PS-interacting protein gene. 

Compounds identified by this method will have potential utility in 
modifying the expression of the PS-interacting protein genes in vivo . These 
compounds may be further tested in the animal models disclosed and enabled herein 
to identify those compounds having the most potent in vivo effects. In addition, as 
described herein with respect to small molecules having binding activity for PS- 
interacting proteins, these molecules may serve as "lead compounds" for the fimher 
development of pharmaceuticals by, for example, subjecting the compounds to 
sequential modifications, molecular modeling, and other routine procedures employed 
in rational drug design. 



^- Mentificatipn of Compounds with PS-Interartin g Protein Binriinp r»^<,nit^, 

In light of the present disclosure, one of ordinary skill in the art is enabled 
to practice new screening methodologies which will be useful in the identification of 
proteins and other compounds which bind to, or otherwise directly interact with, the 
PS-interacting proteins. The proteins and compounds will include endogenous 
cellular components, aside from the presenilins, wWch interact with the PS-interacting 
proteins invivo and which, therefore, provide new targets for pharmaceutical and 
30 therapeutic interyemions, as well as recombinant, synthetic and otherwise exogenous 
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compounds which may have PS-interacting protein binding capacity and, therefore, 
may be candidates for pharmaceutical agents. Thus, in one series of embodiments, 
cell lysates or tissue homogenates (e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which bind to one of the 
5 normal or mutant PS-interacting proteins. Alternatively, any of a variety of 

exogenous compounds, both naturally occurring and/or synthetic (e.g., libraries of 
small molecules or peptides), may be screened for PS-interacting protein binding 
capacity. Small molecules are particular preferred in this context because they are 
more readily absorbed after oral administration, have fewer potential antigenic 

10 determinants, and/or are more likely to cross the blood brain barrier than larger 

molecules such as nucleic acids or proteins. The methods of the present invention are 
particularly useful in that they may be used to identify molecules which selectively or 
preferentially bind to a mutant form of a PS-interacting protein (rather than a normal 
form) and, therefore, may have particular utility in treating cases of AD which arise 

15 from mutations in the PS-interacting proteins. 

Once identified by the methods described above, the candidate compounds 
may then be produced in quantities sufficient for pharmaceutical administration or 
testing (e.g., ^g or mg or greater quantities), and formulated in a pharmaceutically 
acceptable carrier (see, e.g.. Remington's Pharmaceutical Sciences. Gennaro, A., ed., 

20 Mack Pub., 1 990). These candidate compounds may then be administered to the 

transformed cells of the invention, to the transgenic animal models of the invention, to 
cell lines derived from the animal models or from human patients, or to Alzheimer's 
patients. The animal models described and enabled herein are of particular utility in 
further testing candidate compounds which bind to normal or mutant PS-interacting 

25 proteins for their therapeutic efficacy. 

In addition, once identified by the methods described above, the candidate 
compounds may also serve as "lead compounds" in the design and development of 
new pharmaceuticals. For example, as in well known in the art, sequential 
modification of small molecules (e.g., amino acid residue replacement with peptides; 

30 functional group, replacement with peptide or non-peptide compounds) is a standard 
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approach in the phannaceutical industry for the development of new phannaceuticals. 
Such development generally proceeds from a "lead compound" which is shown to 
have at least some of the activity (e.g., PS-interacting protein binding or blocking 
ability) of the desired pharmaceutical. In particular, when one or more compounds 
having at least some activity of imerest (e.g., modulation of PS-interacting protein 
activity) are identified, structural comparison of the molecules can greatly inform the 
skilled practitioner by suggesting portions of the lead compounds which should be 
conserved and portions which may be varied in the design of new candidate 
compounds. Thus, the present invention also provides a means of idemifying lead 
compounds which may be sequentially modified to produce new candidate 
compounds for use in the treatment of Alzheimer's Disease. These new compounds 
then may be tested both for binding to PS-imeracting proteins and/or blocking PS- 
interacting protein activity, and for therapeutic efficacy (e.g., in the animal models 
described herein). This procedure may be iterated until compounds having the desired 
15 ther^eutic activity and/or efficacy are identified. 

In each of the present series of embodiments, an assay is conducted to 
detect binding between a "PS-interacting protein componem" and some other moiety. 
Of particular utility will be sequential assays in which compounds are tested for the 
ability to bind to only normal or only mutant fomis of the PS-interacting domains of 
PS-interacting proteins in the binding assays. Such compounds an expected to have 
the greatest therapeutic utilities, as described more fully below. The "PS-interacting 
protein component" in these assays may be a complete nonnal or mutant form of a 
PS-interacting protein (e.g.. S5a, GT24. pG071. Rab 1 1 . etc.) but need not be. Rather, 
particular fimctional domains of the PS-interacting proteins, particularly the PS- 
interacting domains as described above, may be employed either as separate 
molecules or as part of a fusion protein. For example, to isolate proteins or 
compounds that interact with these functional domains, screening may be carried out 
using fusion constructs and/or synthetic peptides corresponding to these regions. 
Thus, for S5a, GST-fusion peptides may be made including sequences corresponding 
approximately to.amino acids 70-377 of SEQ ID NO: 2 (included in clones Y2H29 
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and Y2H31, see Example 1), approximately to amino acids 206-377 of SEQ ID NO: 2 
(which includes protein-protein interaction motifs, see Ferrell et al., 1996), or to any 
other S5a domain of interest. Similarly, for GT24, GST- or other fusion peptides may 
be produced including sequences corresponding approximately to amino acids 440- 
5 815 of SEQ ID NO: 4 (including part of the armadillo repeat segment). Obviously, 
various combinations of fusion proteins and PS-interacting protein functional domains 
are possible and these are merely examples. In addition, the functional domains may 
be altered so as to aid in the assay by, for example, introducing into the functional 
domain a reactive group or amino acid residue (e.g., cysteine) which will facilitate 

10 immobilization of the domain on a substrate (e.g., using sulfhydryl reactions). Thus, 
for example, the PS-interacting domain of S5a may be synthesized containing an 
additional C-terminal cysteine residue to facilitate immobilization of the domain. 
Such peptides may be used to create an affinity substrate for affinity chromatography 
(Suifo-link; Pierce) to isolate binding proteins for microsequencing. Similarly, other 

15 functional domain or antigenic fiagments may be created with modified residues (see, 
e.g.. Example 4). 

The proteins or other compounds identified by these methods may be 
purified and characterized by any of the standard methods known in the art. Proteins 
may, for example, be purified and separated using electrophoretic (e.g., SDS-PAGE, 

20 2D PAGE) or chromatographic (e.g., HPLC) techniques and may then be 

microsequenced. For proteins with a blocked N-terminus. cleavage (e.g., by CNBr 
and/or trypsin) of the particular binding protein is used to release peptide firagments. 
Further purification/characterization by HPLC and microsequencing and/or mass 
spectrometry by conventional methods provides internal sequence data on such 

25 blocked proteins. For non-protein compounds, standard organic chemical analysis 
techniques (e.g., IR, NMR and mass spectrometry; functional group analysis; X-ray 
crystallography) may be employed to determine their structure and identity. 

Methods for screening cellular lysates, tissue homogenates, or small 
-molecule libraries for candidate PS-interaction protein-binding molecules are well 

30 known in the art and. in light of the present disclosure, may now be employed to 
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identify compounds which bind to normal or mutant PS-interacting protein 
components or which modulate PS-interacting protein activity as defined by non- 
specific measures (e.g., changes in intracellular Ca'\ GTP/GDP ratio) or by specific 
measures (e.g., changes in Ap peptide production or changes in the expression of 
5 other downstream genes which can be monitored by differential display, 2D gel 

electrophoresis, differential hybridization, or SAGE methods). The preferred methods 
involve variations on the following techniques: (1) direct extraction by affinity 
chromatography; (2) co-isolation of PS-interacting protein components and bound 
proteins or other compounds by immunoprecipitation; (3) the Biomolecular 
10 Interaction Assay (BIAcore); and (4) the yeast two-hybrid systems. These and others 
are discussed separately below. 

A. Affinity Chromatography 

In light of the present disclosure, a variety of affinity binding techniques 
well known in the art may be employed to isolate proteins or other compounds which 
15 bind to the PS-interacting protein disclosed or otherwise enabled herein. In general, a 
PS-interacting protein component may be immobilized on a substrate (e.g., a column 
or filter) and a solution including the test compound(s) is contacted with the PS- 
interacting protein, fiision or fragment under conditions which are pemiissive for 
binding. The substrate is then washed with a solution to remove unbound or weakly 
bound molecules. A second wash may then elute those compounds which strongly 
bound to the immobilized normal or mutant PS-interacting protein componem. 
Alternatively, the test compounds may be immobilized and a solution containing one 
or more PS-interacting protein components may be contacted with the column, filter 
or other substrate. The ability of the PS-interacting protein component to bind to the 
test compounds may be determined as above or a labeled form of the PS-imeracting 
protein component (e.g., a radio-labeled or chemiluminescem functional domain) may 
be used to more rapidly assess binding to the substrate-immobilized compound(s). 
B. Co-Immunoprecipi^ atirm 

Another well characterized technique for the isolation of PS-interacting 
protein components and their associated proteins or other compounds is direct 
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inununoprecipitation with antibodies. This procedure has been successfiilly used, for 
example, to isolate many of the syn^tic vesicle associated proteins (Phizicky and 
Fields, 1994). Thus, either nonnal or mutant, free or membrane-bound PS-interacting 
protein components may be mixed in a solution with the candidate compoxmd(s) 
5 under conditions which are permissive for binding, and the PS-interacting protein 
component may be immunoprecipitated. Proteins or other compoimds which co- 
inmiunoprecipitate with the PS-interacting protein component may then be identified 
by standard techniques as described above. General techniques for 
immunoprecipitation may be found in, for example, Harlow and Lane, (1988) 
^0 Antibodies: A Laborat ory Manual . Cold Spring Harbor Press, Cold Spring Harbor, 
NY. 

The antibodies employed in this assay, as described and enabled herein, 
may be polyclonal or monoclonal, and include the various antibody fragments (e.g.. 
Fab, F(ab*)2,) as well as single chain antibodies, and the like. 

15 C. The Biomolecular Interaction Assay 

Another usefiil method for the detection and isolation of binding proteins 
is the Biomolecular Interaction Assay or "BL\.core" system developed by Pharmacia 
Biosensor and described in the manufacturer's protocol (LKB Pharmacia, Sweden). In 
light of the present disclosure, one of ordinary skill in the art is now enabled to 

20 employ this system, or a substantial equivalent, to identify proteins or other 

compounds having PS-interacting protein binding capacity. The BL^core system uses 
an affinity piuified anti-GST antibody to immobilize GST-fusion proteins onto a 
sensor chip. Obviously, other fusion proteins and corresponding antibodies may be 
substituted. The sensor utilizes surface plasmon resonance which is an optical 

25 phenomenon that detects changes in refractive indices. A homogenate of a tissue of 
interest is passed over the inunobilized fusion protein and protein-protein interactions 
are registered as changes in the refractive index. This system can be used to 
determine the kinetics of binding and to assess whether any observed binding is of 
physiological relevance. 
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The Yeast T wo-HvhriH .Sy ctPm 
The yeast "two-hybrid" system takes advantage of transcriptional factors 
that are composed of two physically separable, functional domains (Phizicky and 
Fields, 1994). The most commonly used is the yeast GAL4 transcriptional activator 
5 consisting of a DNA binding domain and a transcriptional activation domain. Two 
different cloning vectors are used to generate separate fusions of the GAL4 domains 
to genes encoding potential binding proteins. The fusion proteins are co-expressed, 
targeted to the nucleus and. if interactions occur, activation of a reporter gene (e.g.. 
lacZ) produces a detectable phenotype. For example, the Clontech Matchmaker 
10 System-2 may be used with the Clontech brain cDNA GAL4 activation domain fusion 
library with PS-interacting pn,tein-GAL4 binding domain fusion clones (Clontech. 
Palo Alto. CA). In light of the disclosures herein, one of ordinal skill in the art is 
now enabled to produce a variety of PS-interacting protein fusions, including fusions 
including either nonnal or mutant functional domains of the PS-inteiacting proteins. 
15 and to screen such fusion libraries in order to identify PS-interacting protein binding 
proteins. 

E. Other Methods 

The nucleotide sequences and protein products, including both mutant and 
normal fomis of these nucleic acids and their corresponding proteins, can be used with 
the above techniques to isolate other interacting proteins, and to identify other genes 
whose expression is altered by the over-expression of normal PS-interacting protein 
sequences, by die under-expression of nomtal PS-interacting protein sequences, or by 
the expression of mutant PS-interacting protein sequences. Identification of these 
other interacting proteins, as well as the identification of other genes whose 
25 expression levels are alters in AD will identify other gene targets which have direct 
relevance to the pathogenesis of this disease in its clinical or pathological forms. 
Specifically, other genes will be identified which may themselves be the site of other 
mutations causing Alzheimer's Disease, or which can themselves be targeted 
therapeutically (e.g.. to reduce their expression levels to nomial. or to 
30 pharmacologically block the effects of their over-expression) as a potential treatment 
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for this disease. Specifically, these techniques rely on PCR-based and/or 
hybridization-based methods to identify genes which are differentially expressed 
between two conditions (a cell line expressing normal PS-interacting proteins 
compared to the same cell type expressing a mutant PS-interacting protein). These 
5 techniques include differential display, serial analysis of gene expression (SAGE), and 
mass-spectrometry of protein 2D-gels and subtractive hybridization (reviewed in 
Nowak, 1995 and Kahn, 1995). 

As will be obvious to one of ordinary skill in the art, there are numerous 
other methods of screening individual proteins or other compounds, as well as large 

10 libraries of proteins or other compounds (e.g., phage display libraries and cloning 

systems from Stratagene, La Jolla, CA) to identify molecules which bind to normal or 
mutant PS-interacting protein components. All of these methods comprise the step of 
mixing a normal or mutant PS-interacting protein, fusion, or fragment with test 
compounds, allowing for binding (if any), and assaying for bound complexes. All 

15 such methods are now enabled by the present disclosure of substantially pure PS- 
interacting proteins, substantially pure PS-interacting functional domain fragments, 
PS-interacting protein fusion proteins, PS-interacting protein antibodies, and methods 
of making and using the same. 

20 8. Disrupting PS-Interacting Protein Interactions 

The ability to disrupt specific interactions of the PS-interacting proteins 
with the presenilins, or with other proteins, is potentially of great therapeutic value, 
and will be important in understanding the etiology of AD and in identifying 
additional targets for ther^y. The methods used to identify compounds which disrupt 

25 PS-interacting protein interactions may be applied equally well to interactions 
involving either normal or mutant PS-interacting proteins. 

Assays for compounds which can disrupt PS-interacting protein 
interactions may be performed by any of a variety of methods well known in the art. 
In essence, such assays will parallel those assays for identifying proteins and 

30 compounds with binding activity toward the PS-interacting proteins. Thus, once a 
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compound with binding activity for a PS-interacting protein is identified by any 
method, that method or an equivalent method may be perfomed in the presence of 
candidate compounds to identify compounds which disrupt the interaction. Thus, for 
example, the assay may employ methods including (1) affinity chromatography; (2) 
5 iramunoprecipitation; (3) the Bioraolecular Interaction Assay (BIAcore); or (4) the 
yeast two-hybrid systems. Such assays can be developed using either normal or 
mutant purified PS-interacting proteins, and/or either normal or mutant purified 
binding proteins (e.g., normal or mutant presenilins). 

For affinity methods, either the PS-interacting protein or its binding 
10 partner may be affixed to a matrix, for example in a column, and the counterpart 
protein (e.g., the PS-interacting protein if presenilin or another binding partner is 
affixed to the matrix; or a preseniUn or other binding partner if the PS-interacting 
protein is affixed to the matrix) is then exposed to the affixed protein/compound either 
before or after adding the candidate compound(s). In the absence of a disruptive 
15 effect by the candidate compound(s), the interaction between the PS-interacting 

protein and its binding partner will cause the counterpart protein to bind to the affixed 
protein. Any compound which disrupts the interaction will cause release of the 
counterpart protein from the matrix. Release of the counterpart protein from the 
matrix can be measured using methods known in the art. 

PS-interacting protein interactions which are detectable by yeast two- 
hybrid systems, these assays may also be employed to identify compounds which 
dismpt the interaction. Briefly, a PS-interacting protein and its binding partner (or 
appropriate structural domains of each) are employed in the fusion proteins of the 
system, and the cells are exposed to candidate compounds to determine their effect 
25 upon the expression of the reporter gene. By appropriate choice of a reporter gene, 
such a system can be readily adapted for high through-put screening of large libraries 
of compounds by. for example, using a reporter gene which confers resistance to an 
antibiotic which is present in the medium, or which rescues an auxotrophic strain 
grown in minimal medium. 
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These assays may be used to screen many different types of compounds for 
their disruptive effect on the interactions of the PS-interacting proteins. For example, 
the compounds may belong to a library of synthetic molecules, or be specifically 
designed to disrupt the interaction. The compounds may also be peptides 
5 corresponding to the interacting domain of either protein. This type of assay can be 
used to identify compounds that disrupt a specific interaction between a given PS- 
interacting protein variant and a given binding partner. In addition, compounds that 
disrupt all interactions with PS-interacting proteins may be identified. For example, a 
compound that specifically disrupts the folding of PS-interacting proteins would be 
10 expected to disrupt all interactions between PS-interacting proteins and other proteins. 
Alternatively, this type of disruption assay can be used to identify compounds which 
disrupt only a range of different PS-interacting protein interactions, or only a single 
PS-interacting protein interaction. 

15 9. Methods of Identifying Compounds Modulating PS-Interacting Protein Activitv 
In another series of embodiments, the present invention provides for 
methods of identifying compounds with the ability to modulate the activity of nomial 
and mutant PS-interacting proteins. As used with respect to this series of 
embodiments, the tenn "activity" broadly includes gene and protein expression, PS- 

20 interacting protein post-translation processing, trafficking and localization, and any 
functional activity (e.g., enzymatic, receptor-effector, binding, channel), as well as 
downstream affects of any of these. It is known that Alzheimer's Disease is associated 
with increased production of the long form of AP peptides, the appearance of amyloid 
plaques and neurofibrillary tangles, decreases in cognitive fimction, and apoptotic cell 

25 death. Therefore, using the transformed cells and transgenic animal models of the 
present invention, cells obtained from subjects bearing normal or mutant PS- 
interacting protein genes, or animals or human subjects bearing naturally occurring 
normal or mutant PS-interacting proteins, it is now possible to screen candidate 
pharmaceuticals and treatments for their therapeutic effects by detecting changes in 
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one or more of these functional characteristics or phenotypic manifestations of normal 
or mutant PS-interacting protein expression. 

Thus, the present invention provides methods for screening or assaying for 
proteins, small molecules or other compounds which modulate PS-interacting protein 
5 activity by contacting a cell in vivo or in vitro with a candidate compound and 
assaying for a change in a marker associated with normal or mutant PS-interacting 
protein activity. The marker associated with PS-inteiacting protein activity may be 
any measurable biochemical, physiological, histological and/or behavioral 
characteristic associated with PS-interacting protein expression." In particular, useful 
10 markers will include any measurable biochemical, physiological, histological and/or 
behavioral characteristic which distinguishes cells, tissues, animals or individuals 
bearing at least one mutant presenilin or PS-interacting protein gene fiom their normal 
counterparts. In addition, the marker may be any specific or non-specific measure of 
presenilin or PS-interacting protein activity. PS-interacting protein specific measures 
15 include measures of PS-interacting protein expression (e.g., PS-interacting protein 
mRNA or protein levels) which may employ the nucleic acid probes or antibodies of 
the present invention. Non-specific measures include changes in cell physiology such 
as pH, intracellular calcium, cyclic AMP levels, GTP/GDP ratios, 
phosphatidylinositol activity, protein phosphorylation, etc., which can be monitored 
20 on devices such as the cytosensor microphysiometer (Molecular Devices Inc., United 
States). The activation or inhibition of PS-interacting protein activity in its mutant or 
normal form can also be monitored by examining changes in the expression of other 
genes (e.g., the presenilins) which are specific to the PS-interacting protein pathway 
leading to Alzheimer's Disease. These can be assayed by such techniques as 
25 differential display, differential hybridization, and SAGE (sequential analysis of gene 
expression), as well as by two dimensional gel electi-ophoresis of cellular lysates. In 
each case, the differentially-expressed genes can be ascertained by inspection of 
identical studies before and after application of the candidate compound. 
Furthermore, as noted elsewhere, the particular genes whose expression is modulated 
by the administration of the candidate compound can be ascertained by cloning. 
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nucleotide sequencing, amino acid sequencing, or mass spectrometry (reviewed in 
Nowak, 1995). 

In general, a cell may be contacted with a candidate compound and, after 
an ^propriate period (e.g., 0-72 hours for most biochemical measures of cultured 
5 cells), the marker of presenilin or PS-interacting protein activity may be assayed and 
compared to a baseline measurement. The baseline measurement may be made prior 
to contacting the cell with the candidate compound or may be an external baseline 
established by other experiments or known in the art. The cell may be a transformed 
cell of the present invention or an explant from an animal or individual. In particular, 

10 the cell may be an explant from a carrier of a presenilin or PS-interacting protein 

mutation (e.g., a himian subject with Alzheimer*s Disease) or an animal model of the 
invention (e.g., a transgenic nematode or mouse bearing a mutant presenilin or PS- 
interacting protein gene). To augment the effect of presenilin or PS-interacting 
protein mutations on the Ap pathway, transgenic cells or animals may be employed 

15 which have increased Ap production. Preferred cells include those from neurological 
tissues such as neuronal, glial or mixed cell cultures; and cultured fibroblasts, liver, 
kidney, spleen, or bone marrow. The cells may be contacted with the candidate 
compounds in a culture in vitro or may be administered in vivo to a live animal or 
human subject. For live animals or human subjects, the test compound may be 

20 administered orally or by any parenteral route suitable to the compound. For clinical 
trials of human subjects, measurements may be conducted periodically (e.g., daily, 
weekly or monthly) for several months or years. 

Because most individuals beaing a mutation in a particular gene are 
heterozygous at that locus (i.e., bearing one normal and one mutant allele), 

25 compounds may be tested for their ability to modulate normal as well as mutant 
presenilin or PS-interacting protein activity. Thus, for example, compounds which 
enhance the function of normal presenilins or PS-interacting proteins may have utility 
in treating Alzheimer's Disease or related disorders. Alternatively, because 
suppression of the activity of both normal and mutant copies of a gene in a 

30 heterozygous individual may have less severe clinical consequences than progression 
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of the associated disease, it may be desired to identify compound which inactivate or 
suppress all forms of the presenilins, the PS-interacting proteins, or their interactions. 
Preferably, however, compounds are identified which selectively or specifically 
inactivate or suppress the activity of mutant presenilin or PS-interacting proteins 
5 without disrupting the function of their normal counterparts. 

In light of the identification, characterization, and disclosure herein of a 
novel group of PS-interacting genes and proteins, tiie PS-interacting protein nucleic 
acid probes and antibodies, and the PS-interacting protein transfoimed cells and 
transgenic animals of the invention, one of ordinary skill in the art is now enabled by 
10 perform a great variety of assays which will detect the modulation of presenilin and/or 
PS-interacting protein activity by candidate compounds. Particularly preferred and 
contemplated embodiments are discussed in some detail below. 

A. PS-Interactine Protein Expression 

In one series of embodiments, specific measures of PS-interacting protein 
15 expression are employed to screen candidate compounds for their ability to affect 

presenilin activity. Thus, using the PS-interacting protein nucleic acids and antibodies 
disclosed and otherwise enabled herein, one may use mRNA levels or protein levels 
as a marker for the ability of a candidate compound to modulate PS-interacting 
protein activity. The use of such probes and antibodies to measure gene and protein 
expression is well known in the art and discussed elsewhere herein. Of particular 
interest may be the identification of compounds which can alter the relative levels of 
different variants (e.g., mutant and normal) of the PS-interacting proteins. 

B. Intracellular Localization 

In another series of embodiments, compounds may be screened for their 
ability to modulate the activity of the PS-interacting proteins based upon their effects 
on the trafficking and intracellular localization of the PS-interacting proteins. The 
presenilins and some of the PS-intetacting proteins (e.g., S5a) have been seen 
immunocytochemically to be localized in membrane structures associated with the 
endoplasmic reticulum and Golgi apparatus. Differences in localization of mutant and 
normal presenilins or PS-interacting proteins may, therefore, contribute to the etiology 
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of Alzheimer's Disease and related disorders. Compounds which can affect the 
localization of these proteins may, therefore, be identified as potential therapeutics. 
Standard techniques known in the art may be employed to detect the localization of 
the presenilins and PS-interacting proteins. Generally, these techniques will employ 
5 the antibodies of the present invention, and in particular antibodies which selectively 
bind to one or more mutant PS-interacting proteins but not to normal proteins. As is 
well known in the art, such antibodies may be labeled by any of a variety of 
techniques (e.g., fluorescent or radioactive tags, labeled secondary antibodies, avidin- 
biotin, etc.) to aid in visualizing the intracellular location of these proteins. The PS- 

10 interacting proteins may be co-localized to particular structures, as in known in the 
art, using antibodies to markers of those structures (e.g., TGN38 for the Golgi, 
transferrin receptor for post-Golgi transport vesicles, LAMP2 for lysosomes). 
Western blots of purified fractions from ceil lysates enriched for different intracellular 
membrane bound organelles (e.g., lysosomes, synaptosomes, Golgi) may also be 

15 employed. 

B. Ion Regulation/Metabolism 

In another series of embodiments, compoimds may be screened for their 
ability to modulate the activity of the presenilins or PS -interacting proteins based 
upon measures in intracellular Ca^*, Na* or K* levels or metabolism. As noted above, 

20 the presenilins are membrane associated proteins which may serve as, or interact with, 
ion receptors or ion channels. Thus, compounds may be screened for their ability to 
modulate presenilin and PS-interacting protein-related metabolism of calcium or other 
ions either in vivo or in vitro by, for example, measurements of ion channel fluxes 
and/or transmembrane voltage and/or current fluxes, using patch clamps, voltage 

25 clamps or fluorescent dyes sensitive to intracellular ion levels or transmembrane 
voltage. Ion charmel or receptor function can also be assayed by measurements of 
activation of second messengers such as cyclic AMP, cGMP tyrosine kinases, 
phosphates, increases in intracellular Ca^^ levels, etc. Recombinantly made proteins 
may also be reconstructed in artificial membrane systems to study ion channel 

30 conductance and, therefore, the "cell" employed in such assays may comprise an 
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artificial membrane or cell. Assays for changes in ion regulation or metabolism can 
be performed on cultured cells expressing endogenous normal or mutant presenilins 
and PS-interacting proteins. Such shidies also can be performed on cells transfected 
with vectors capable of expressing one of the presenilins or PS-interacting proteins, or 
5 functional domains of one of the presenilins or PS-interacting proteins, in normal or 
mutant form. In addition, to enhance the signal measured in such assays, cells may be 
co-transfected with genes encoding ion channel proteins. For example, Xenopus 
oocytes or rat kidney (HEK293) cells may be co-transfected with sequences encoding 
rat brain Na* p 1 subunits, rabbit skeletal muscle Ca^* p 1 subunits. or rat heart p 1 
10 subunits. Changes in presenilin or PS-interacting protein-mediated ion channel 
activity can be measured by. for example, two-microelcctrode voltage-clamp 
recordings in oocytes, by whole-cell patch-clamp recordings in HEK293 cells, or by 
equivalent means. 

C. ApoDtosis or Cell Death 

In another series of embodiments, compounds may be screened for their 
ability to modulate the activity of the presenilins or PS-interacting proteins based 
upon their effects on presenilin or PS-interacting protein-related apoptosis or cell 
death. Thus, for example, baseline rates of apoptosis or cell death may be established 
for cells in culture, or the baseline degree of neuronal loss at a particular age may be 
20 established post-mortem for animal models or human subjects, and the ability of a 
candidate compound to suppress or inhibit apoptosis or cell death may be measured. 
Cell death may be measured by standard microscopic techniques (e.g., light 
microscopy) or apoptosis may be measured more specifically by characteristic nuclear 
morphologies or DNA fragmentation patterns which create nucleosomal ladders (see, 
e.g., Gavrieli et al., 1992; Jacobson et al., 1993; Vito et al., 1996). TUNEL may also 
be employed to evaluate cell death in brain (see, e.g.. Lassmann et al.. 1995). In 
preferred embodiments, compounds are screened for their ability to suppress or inhibit 
neuronal loss in the transgenic animal models of the invention. Transgenic mice 
bearing, for example, a mutant human, mutant mouse, or humanized mutant presenilin 
or PS-interacting protein gene may be employed to identify or evaluate compounds 
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which may delay or arrest the neurodegeneration associated with Alzheimer's 
Disease. A similar transgenic mouse model, bearing a mutant APP gene, has recently 
been reported by Games et al. (1995). 

D. Ap Peptide Production 
5 In another series of embodiments, compounds may be screened for their 

ability to modulate presenilin or PS-interacting protein-related changes in APP 
processing. The Ap peptide is produced in several isoforms resulting from differences 
in APP processing. The Ap peptide is a 39 to 43 amino acid derivative of PAPP 
which is progressively deposited in difRise and senile plaques and in blood vessels of 

10 subjects with AD. In human brain, AP peptides are heterogeneous at both the N- and 
C-termini. Several observations, however, suggest that both the full length and N- 
terminal truncated forms of the long-tailed AP peptides ending at residue 42 or 43 
(i.e., Ap 1-42/43 and APx-42/43) have a more important role in AD than do peptides 
ending at residue 40. Thus, Ap 1-42/43 and APx-42/43 are an early and prominent 

15 feature of both senile plaques and diffuse plaques, while peptides ending at residue 40 
(i.e., Api-40 and APx-40) are predominantly associated with a subset of mature 
plaques and with amyloidotic blood vessels (see, e.g., Iwatsubo et al., 1995; Gravina 
et al., 1995; Tamaoka et al., 1995; Podlisny et al. 1995). Furthermore, the long-tailed 
isoforms have a greater propensity to fibril formation, and are thought to be more 

20 neurotoxic than Apl-40 peptides (Pike et al, 1993; Hilbich et al., 1991). Finally, 
missense mutations at codon 717 of the PAPP gene are associated with early onset 
FAD, and resuU in overproduction of long-tailed Ap in the brain of affected mutation 
carriers, in peripheral cells and plasma of both affected and presymptomatic carriers, 
and in cell lines transfected with pAPP,,, mutant cDNAs (Tamaoka et al., 1994; 

25 Suzuki etal., 1994). 

Thus, in one series of embodiments, the present invention provides 
methods for screening candidate compounds for their ability to block or inhibit the 
increased production of long isoforms of the Ap peptides in cells or transgenic 
animals expressing a normal or mutant presenilin gene and/or a normal or mutant PS- 

30 interacting protein gene. In particular, the present invention provides such methods in 
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which cultured mammalian cells, such as brain cells or fibroblasts, have been 
transformed according to the methods disclosed herein, or in which transgenic 
animals, such as rodents or non-human primates, have been produced by the methods 
disclosed herein, to express relatively high levels of a normal or mutant presenilin or 
5 PS-interacting protein. Optionally, such cells or transgenic animals may also be 
transformed so as to express a nonnal or mutant fonn of the pAPP protein at 
relatively high levels. 

In this series of embodiments, the candidate compound is administered to 
the cell line or transgenic animals (e.g.. by addition to the media of cells in culture; or 
30 by oral or parenteral administration to an animal) and, after an appropriate period 
(e.g., 0-72 hours for cells in culture, days or months for animal models), a biological 
sample is collected (e.g., cell culture supernatant or cell lysate from cells in culture; 
tissue homogenate or plasma from an animal) and tested for the level of the long 
isofonns of the AP peptides. The levels of the peptides may be determined in an 
15 absolute sense (e.g.. nMol/ml) or in a relative sense (e.g., ratio of long to short Ap 
isoforms). The Ap isofonns may be detected by any means known in the art (e.g.. 
electrophoretic separation and sequencing) but, preferably, antibodies which are 
specific to the long isofonn are employed to detennine the absolute or relative levels 
of the APM2/43 or APx-42/43 peptides. Candidate phamaceuticals or therapies 
which reduce the absolute or relative levels of these long Ap isoforms, particularly in 
the transgenic animal models of the invention, are likely to have therapeutic utility in 
the treatment of Alzheimer's Disease, or other disoniers caused by mutations in the 
presenilins or PS-interacting proteins, or by other aberrations in APP metabolism. 
Phosphorylation of Mifimt ubule Assnriated Proteins 
In another series of embodiments, candidate compounds may be screened 
for their ability to modulate presenilin or PS-interacting protein activity by assessing 
the effect of the compound on levels of phosphorylation of microtubule associated 
proteins (MAPs) such as tau. The abnomal phosphorylation of tau and other MAPs 
in the brains of victims of Alzheimer's Disease is well known in the art. Thus, 
compounds which prevent or inhibit the abnormal phosphorylation of MAPs may 
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have utility in treating presenilin or PS-interacting protein-associated diseases such as 
AD. As above, cells from normal or mutant animals or subjects, or the transformed 
cell lines and animal models of the invention may be employed. Prefenred assays will 
employ cell lines or animal models transformed with a mutant human or humanized 
5 mutant presenilin or PS-interacting protein gene. The baseline phosphorylation state 
of MAPs in these cells may be established and then candidate compounds may be 
tested for their ability to prevent, inhibit or counteract the hypeiphosphorylation 
associated with mutants. The phosphorylation slate of the MAPs may be determined 
by any standard method known in the art but, preferably, antibodies which bind 
10 selectively to phosphorylated or unphosphorylated epitopes arc employed. Such 
antibodies to phosphorylation epitopes of the tau protein are known in the art (e.g., 
ALZ50). 

10. Screening and Diagnostics for Alzheimer's Disease 

15 A. General Diagnostic Methods 

The PS-interacting genes and gene products, as well as the PS-interacting 
protein derived probes, primers and antibodies, disclosed or otherwise enabled herein, 
are useful in the screening for carriers of alleles associated with Alzheimer's Disease, 
for diagnosis of victims of Alzheimer's Disease, and for the screening and diagnosis of 

20 related presenile and senile dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral hemorriiage, all of 
which are seen to a greater or lesser extent in symptomatic human subjects bearing 
mutations in the PS 1 or PS2 genes or in the APP gene. Individuals at risk for 
Alzheimer's Disease, such as those with AD present in the family pedigree, or 

25 individuals not previously known to be at risk, may be routinely screened using 

probes to detect the presence of a mutant PS-interacting protein gene or protein by a 
variety of techniques. Diagnosis of inherited cases of these diseases can be 
accomplished by methods based upon the nucleic acids (including genomic and 
mRNA/cDNA sequences), proteins, and/or antibodies disclosed and enabled herein, 

30 including functional assays designed to detect failure or augmentation of the normal 
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presenilin or PS-interacting protein activity and/or the presence of specific new 
activities confened by mutant PS-interacting proteins. Preferably, the methods and 
products are based upon the human nucleic acids, proteins or antibodies, as disclosed 
or otherwise enabled herein. As will be obvious to one of ordinary skill in the art. 
however, the significant evolutionary conservation of large portions of nucleotide and 
amino acid sequences, even in species as diverse as humans, mice, C. elegans. and 
^°'°Ph"^ the skilled artisan to make use of non-human homologues of the 
PS-imeracting proteins to produce usefiil nucleic acids, proteins and antibodies, even 
for applications directed toward human or other animal subject's, thus, for brevity of 
exposition, but without limiting the scope of the invention, the following description 
will focus upon uses of the human homologues of PS-interacting proteins and genes. 
It will be understood, however, that homologous sequences from other species will be 
equivalent for many purposes. 

As will be appreciated by one of ordinary skill in the art, the choice of 
diagnostic methods of the present invention will be influenced by the nature of the 
available biological samples to be tested and the nature of the information required. 
Alzheimer's Disease is, of course, primarily a disease of the brain, but brain biopsies 
are invasive and expensive procedures, particularly for routine screening. Other 
tissues which express the presenilins or PS-interacting proteins at significant levels 
20 may, therefore, be preferred as sources for samples. 

Protein Ba sed Screens and Diapn^ stirc 
When a diagnostic assay is to be based upon PS-interacting proteins, a variety 
of approaches are possible. For example, diagnosis can be achieved by monitoring 
differences in the electropboretic mobility of normal and mutant proteins. Such an 
approach will be particularly useful in identifying mutants in which charge 
substitutions are present, or in which insertions, deletions or substimtions have 
resulted in a significant change in the electropboretic migration of the resultant 
protein. Alternatively, diagnosis may be based upon differences in the proteolytic 
cleavage patterns of normal and mutant proteins, differences in molar ratios of the 
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various amino acid residues, or by functional assays demonstrating altered function of 
the gene products. 

In preferred embodiments, protein-based diagnostics will employ differences 
in the ability of antibodies to bind to normal and mutant PS-interacting proteins. Such 
5 diagnostic tests may employ antibodies which bind to the normal proteins but not to 
mutant proteins, or vice versa. In particular, an assay in which a plurality of 
monoclonal antibodies, each capable of binding to a mutant epitope, may be 
employed. The levels of anti-mutant antibody binding in a sample obtained from a 
test subject (visualized by, for example, radiolabelling, ELISA or chemiluminescence) 

10 may be compared to the levels of binding to a control sample. Alternatively, 
antibodies which bind to normal but not mutant proteins may be employed, and 
decreases in the level of antibody binding may be used to distinguish homozygous 
normal individuals from mutant heterozygotes or homozygotes. Such antibody 
diagnostics may be used for in situ inmiunohistochemistry using biopsy samples of 

15 CNS tissues obtained antemortem or postmortem, including neuropathological 

structures associated with these diseases such as neurofibrillary tangles and amyloid 
plaques, or may be used with fluid samples such a cerebrospinal fluid or with 
peripheral tissues such as white blood cells. 

C, Nucleic Acid Based Screens and Diagnostics 

20 When the diagnostic assay is to be based upon nucleic acids from a sample, 

the assay may be based upon mRNA, cDNA or genomic DNA. When mRNA is used 
from a sample, there are considerations with respect to source tissues and the 
possibility of alternative splicing. That is, there may be little or no expression of 
transcripts unless appropriate tissue sources are chosen or available, and alternative 

25 splicing may result in the loss of some information or difficulty in interpretation. 

Whether mRNA, cDNA or genomic DNA is assayed, standard methods well known in 
the ait may be used to detect the presence of a particular sequence either in situ or in 
vitro (see, e.g., Sambrook et al., (1989) Molecular Cloning: A Laboratory Manual, 
2nd ed.. Cold Spring Harbor Press, Cold Spring Harbor, NY). As a general matter, 

30 however, any tissue with nucleated cells may be examined. 
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Genomic DNA used for the diagnosis may be obtained from body cells, such 
as those present in the blood, tissue biopsy, surgical specimen, or autopsy material. 
The DNA may be isolated and used directly for detection of a specific sequence or 
may be amplified by the polymerase chain reaction (PCR) prior to analysis. 
5 Similarly, RNA or cDNA may also be used, with or without PCR amplification. To 
detect a specific nucleic acid sequence, direct nucleotide sequencing, hybridization 
using specific oligonucleotides, restriction enzyme digest and mapping, PCR 
mapping, RNase protection, chemical mismatch cleavage, ligase-mediated detection, 
and various other methods may be employed. Oligonucleotides specific to particular 
10 sequences can be chemically synthesized and labeled radioactively or non- 

radioactively (e.g., biotin tags, ethidium bromide), and hybridized to individual 
samples immobilized on membranes or other solid-supports (e.g., by dot-blot or 
transfer from gels after electrophoresis), or in solution. The presence or absence of 
the target sequences may then be visualized using methods such as autoradiography, 
15 fluorometry, or colorimetry. These procedures can be automated using redundant, 
short oligonucleotides of known sequence fixed in high density to silicon chips. 
(1) Appropriate Probes and Primers 
Whether for hybridization, RNase protection, ligase-mediated detection, PCR 
amplification or any other standards methods described herein and well known in the 
20 art, a variety of subsequences of the PS-interacting protein sequences disclosed or 
otherwise enabled herein will be usefiil as probes and/or primers. These sequences or 
subsequences will include both normal sequences and deleterious mutant sequences. 
In general, usefiil sequences will include at least 8-9, more preferably 10-50, and most 
preferably 18-24 consecutive nucleotides from introns, exons or intron/exon 
25 boundaries. Depending upon the target sequence, the specificity required, and fiiture 
technological developments, shorter sequences may also have utility. Therefore, any 
PS-interacting protein derived sequence which is employed to isolate, clone, amplify, 
identify or otherwise manipulate a PS-interacting protein sequence may be regarded as 
an appropriate probe or primer. Particularly contemplated as useftil will be sequences 
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including nucleotide positions from the PS-interacting protein genes in which disease- 
causing mutations are known to be present, or sequences which flank these positions. 
(2) Hybridization Screeninp 
For insitu detection of a normal or mutant PS-interacting protein-related 
nucleic acid sequence, a sample of tissue may be prepared by standard techniques and 
then contacted with one or more of the above-described probes, preferably one which 
is labeled to facilitate detection, and an assay fornucleic acid hybridization is 
conducted under stringent conditions which permit hybridization only between the 
probe and highly or perfectly complementary sequences. Because many mutations 
consist of a single nucleotide substitution, high stringency hybridization conditions 
may be required to distinguish normal sequences from most mutant sequences. When 
the PS-interacting protein genotypes of the subject's parents are known, probes may 
be chosen accordingly. Alternatively, probes to a variety of mutants may be 
employed sequentially or in combination. Because most individuals carrying 
15 mutations in the PS-interacting proteins will be heterozygous, probes to normal 
sequences also may be employed and homozygous normal individuals may be 
distinguished from mutant heterozygotes by the amount of binding (e.g., by intensity 
of radioactive signal). In another variation, competitive binding assays may be 
employed in which both normal and mutant probes are used but only one is labeled. 
20 (3) Restriction Maopinp 

Sequence alterations may also create or destroy fortuitous restriction enzyme 
recognition sites which are revealed by the use of appropriate enzyme digestion 
followed by gel-blot hybridization. DNA fragments carrying the site (normal or 
mutant) arc detected by their increase or reduction in size, or by the increase or 
decrease of coiresponding restriction fragment numbers. Such restriction fragment 
length polymorphism analysis (RFLP), or restriction mapping, may be employed with 
genomic DNA, mRNA or cDNA. The PS-interacting protein sequences may be 
amplified by PCR using the above-described primers prior to restriction, in which 
case the lengths of the PCR products may indicate the presence or absence of 
30 particular restriction sites, and/or may be subjected to restriction after amplification. 
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The restriction fragments may be visualized by any convenient means (e.g., under UV 
light in the presence of ethidium bromide). 
(4) PGR Mapp int^ 

In another series of embodiments, a single base substitution mutation may be 
detected based on differential PGR product length or production in PGR. Thus, 
primers which span mutant sites or which, preferably, have 3' termini at mutation 
sites, may be employed to amplify a sample of genomic DNA. mRNA or cDNA from 
a subject. A mismatch at a mutational site may be expected to alter the ability of the 
normal or mutant primers to promote the polymerase reaction and, thereby, result in 
product profiles which differ between normal subjects and heterozygous and/or 
homozygous mutants. The PGR products of the normal and mutant gene may be 
differentially separated and detected by standard techniques, such as polyacrylamide 
or agarose gel electrophoresis and visualization with labeled probes, ethidium 
bromide or the like. Because of possible non-specific priming or readthrough of 
15 mutation sites, as well as the fact that most carriers of mutant alleles will be 
heterozygous, the power of this technique may be low. 
(5) Electrophoretic Mohility 
Genetic testing based on DNA sequence differences also may be achieved by 
detection of alterations in electrophoretic mobility of DNA, mRNA or cDNA 
fragments in gels. Small sequence deletions and insertions, for example, can be 
visualized by high resolution gel electrophoresis of single or double stranded DNA, or 
as changes in the migration pattern of DNA hcteroduplexes in non-denaturing gel 
electrophoresis. Mutations or polymorphisms in tiie PS-interacting protein genes may 
also be detected by methods which exploit mobility shifts due to single-stranded 
conformational polymorphisms (SSGP) associated with mRNA or single-stranded 
DNA secondary structures. 

(6) Ghemical G leavage of Mismatches 
Mutations in die PS-interacting protein genes may also be detected by 
employing the chemical cleavage of mismatch (CGM) method (see, e.g., Saleeba and 
Cotton, 1 993, and references therein). In this technique, probes (up to ~ 1 kb) may be 
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mixed with a sample of genomic DNA, cDNA or mRNA obtained fix>m a subject. 
The sample and probes are mixed and subjected to conditions which allow for 
heteroduplex formation (if any). Preferably, both the probe and sample nucleic acids 
are double-stranded, or the probe and sample may be PCR amplified together, to 
5 ensure creation of all possible mismatch heteroduplexes. Mismatched T residues are 
reactive to osmium tetroxide and mismatched C residues are reactive to 
hydroxylamine. Because each mismatched A will be accompanied by a mismatched 
T, and each mismatched G will be accompanied by a mismatched C, any nucleotide 
differences between the probe and sample (including small insertions or deletions) 

10 will lead to the formation of at least one reactive heteroduplex. After treatment with 
osmium tetroxide and/or hydroxylamine to modify any mismatch sites, the mixture is 
subjected to chemical cleavage at any modified mismatch sites by, for example, 
reaction with piperidine. The mixture may then be analyzed by standard techniques 
such as gel electrophoresis to detect cleavage products which would indicate 

15 mismatches between the probe and sample. 

(7) Other Methods 
Various other methods of detecting PS-interacting protein mutations, based 
upon the sequences disclosed and otherwise enabled herein, will be apparent to those 
of ordinary skill in the art. Any of these may be employed in accordance with the 

20 present invention. These include, but are not limited to, nuclease protection assays 
(SI or ligase-mediated), Ugated PCR, denaturing gradient gel electrophoresis (DGGE; 
see, e.g., Fischer and Lerman, 1983), restriction endonuclease fingerprinting 
combined with SSCP (REF-SSCP; see, e.g., Liu and Sommer, 1995), and the like. 
D. Other Screens and Diagnostics 

25 In inherited cases, as the primary event, and in non-inherited cases as a 

secondary event due to the disease state, abnormal processing of the presenilins, PS- 
interacting proteins, APP, or proteins reacting with the presenilins, PS-interacting 
proteins, or APP may occur. This can be detected as abnormal phosphorylation, 
glycosylation, glycation amidation or proteolytic cleavage products in body tissues or 

30 fluids (e.g., CSF or blood). 
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Diagnosis also can be made by observation of alterations in transcription, 
translation, and post-translational modification and processing, as well as alterations 
in the intracellular and extracellular trafficking of gene products in the brain and 
peripheral cells. Such changes will include alterations in the amount of messenger 
5 RNA and/or protein, alteration in phosphorylation state, abnormal intracellular 
location/distribution, abnormal extracellular distribution, etc. Such assays will 
include: Northern Blots (e.g., with PS-interacting protein-specific and non-specific 
nucleotide probes), Western blots and enzyme-linked immunosorbent assays (ELISA) 
(e.g.. with antibodies raised specifically to a PS-interacting protein or PS-interacting 
10 functional domain, including various post-translational modification states including 
glycosylated and phosphorylated isoforras). These assays can be perfonned on 
peripheral tissues (e.g., blood cells, plasma, cultured or other fibroblast tissues, etc.) 
as well as on biopsies of CNS tissues obtained antemortem or postmortem, and upon 
cerebrospinal fluid. Such assays might also include in situ hybridization and 
immunohistochemistiy (to locaUze messenger RNA and protein to specific subcellular 
compartments and/or within neuropathological structures associated with these 
diseases such as neurofibrillary tangles and amyloid plaques). 
E. Screening and Diagnostic Kits 

In accordance with the present invention, diagnostic kits are also provided 
which will include the reagents necessary for the above-described diagnostic screens. 
For example, kits may be provided which include antibodies or sets of antibodies 
which are specific to one or more mutant epitopes. These antibodies may. in 
particular, be labeled by any of the standard means which facilitate visualization of 
binding. Alternatively, kits may be provided in which oligonucleotide probes or PGR 
25 primers, as described above, are present for the detecUon and/or amplification of 
normal or mutant presenilin and/or PS-interacting protein nucleotide sequences. 
Again, such probes may be labeled for easier detection of specific hybridization. As 
appropriate to the various diagnostic embodiments described above, the 
oligonucleotide probes or antibodies in such kits may be immobilized to substrates 
and ^propriate controls may be provided. 



20 



30 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



-74- 



11. Methods of Treatment 

The present invention now provides a basis for therapeutic intervention in 
diseases which are caused, or which may be caused, by mutations in the PS- 
5 interacting proteins. As noted above, mutations in the hPSl and hPS2 genes have 
been associated with the development of early onset fonns of Alzheimer's Disease 
and, therefore, the present invention is particularly directed to the treatment of 
subjects diagnosed with, or at risk of developing, Alzheimer's Disease. 

Without being bound to any particular theory of the invention, the effect of 

10 the Alzheimer's Disease related mutations in the presenihns appears to be a gain of a 
novel function, or an acceleration of a nomial function, which directly or indirectly 
causes aberrant processing of the Amyloid Precursor Protein (APP) into AP peptide, 
abnormal phosphorylation homeostasis, and/or abnomnal apoptosis in the brain. Such 
a gain of function or acceleration of function model would be consistent with the adult 

15 onset of the symptoms and the dominant inheritance of Alzheimer's Disease. 

Nonetheless, the mechanism by which mutations in the presenilins may cause these 
effects remains unknown. 

The present invention, by identifying a set of PS-interacting proteins, 
provides new therapeutic targets for intervening in the etiology of presenilin-related 

20 AD. In addition, as mutations in the presenilins may cause AD, it is likely that 
mutations in the PS-interacting proteins may also cause AD. The fact that the PS- 
interacting protein SSa is alternately processed in the brains of victims of sporadic 
AD, as well as in the brains of victims of presenilin-linked AD, suggests that, at the 
very least, this PS-interacting protein is involved in the etiology of AD independent of 

25 mutations in the presenilins. It is likely that the other PS-interacting proteins also 
may be involved in non-presenilin-linked AD. 

Therapies to treat PS-interacting protein-associated diseases such as AD 
may be based upon (1) administration of normal PS-interactmg proteins, (2) gene 
therapy with normal PS-interacting protein genes to compensate for or replace the 

30 mutant genes, (3) gene therapy based upon antisense sequences to mutant PS- 
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interacting protein genes or which "knock-out" the mutant genes. (4) gene therapy 
based upon sequences which encode a protein which blocks or corrects the deleterious 
effects of PS-interacting protein mutants. (5) immunotherapy based upon antibodies to 
noimal and/or mutant PS-interacting proteins, or (6) small molecules (drugs) which 
alter PS-interacting protein expression, alter interactions between PS-interacting 
proteins and other proteins or ligands, or which otherwise block the aberrant function 
of mutant presenilin or PS-interacting proteins by altering the structure of the mutant 
proteins, by enhancing their metabolic clearance, or by inhibiting their function. 
A. Protein Therapy 

Treatment of Alzheimer's Disease, or other disorders resulting from PS- 
interacting protein mutations, may be performed by replacing the mutant protein with 
normal protein, by modulating the function of the mutant protein, or by providing an 
excess of normal protein to reduce the effect of any aberrant function of the mutant 
proteins. 

accomplish this, it is necessary to obtain, as described and enabled 
herein, large amounts of substantially pure PS-interacting protein from cultured cell 
systems which can express the protein. Delivery of the protein to the affected brain 
areas or other tissues can then be accomplished using appropriate packaging or 
administration systems including, for example, liposome mediated protein delivery to 
20 the target cells. 

B. Gene Therapy 

In one series of embodiments, gene therapy may be employed in which 
normal copies of a PS-interacting protein gene are introduced into patients to code 
successfully for normal protein in one or more different affected cell types. The gene 
must be delivered to those cells in a fomi in which it can be taken up and code for 
sufficient protein to provide effective function. Thus, it is preferred that the 
recombinant gene be operably joined to a strong promoter so as to provide a high 
level of expression which will compensate for, or out-compete, the mutant proteins. 
As noted above, the recombinant construct may contain endogenous or exogenous 
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regulatory elements, inducible or repressible regulatory elements, or tissue-specific 
regulatory elements. 

In another series of embodiments, gene therapy may be employed to 
replace the mutant gene by homologous recombination with a recombinant construct. 
5 The recombinant construct may contain a normal copy of the targeted PS-interacting 
protein gene, in which case the defect is corrected in situ , or may contain a "knock- 
out** construct which introduces a stop codon, missense mutation, or deletion which 
abolished function of the mutant gene. It should be noted in this respect that such a 
construct may knock-out both the normal and mutant copies of the targeted gene in a 

10 heterozygous individual, but the total loss of gene function may be less deleterious to 
the individual than continued progression of the disease state. 

In another series of embodiments, antisense gene therapy may be 
employed. The antisense ther^y is based on the fact that sequence-specific 
suppression of gene expression can be achieved by intracellular hybridization between 

15 mRNA or DNA and a complementary antisense species. The formation of a hybrid 
duplex may then interfere with the transcription of the gene and/or the processing, 
transport, translation and/or stability of the target mRNA. Antisense strategies may 
use a variety of approaches including the administration of antisense oligonucleotides 
or antisense oligonucleotide analogs (e.g., analogs with phosphorothioate backbones) 

20 or transfection with antisense RNA expression vectors. Again, such vectors may 
include exogenous or endogenous regulatory regions, inducible or repressible 
regulatory elements, or tissue-specific regulatory elements. 

In another series of embodiments, gene therapy may be used to introduce a 
recombinant construct encoding a protein or peptide which blocks or otherwise 

25 corrects the aberrant function caused by a mutant presenilin or PS-interacting protein 
gene. In one embodiment, the recombinant gene may encode a peptide which 
corresponds to a domain of a PS-interacting which has been found to abnormally 
interact with another cell protein or other cell ligand (e.g., a mutant presenilin). Thus, 
for example, if a mutant PSl TM6-»7 domain is found to interact with a PS- 

30 interacting protein but the corresponding normal TM6^7 domain does not undergo 
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this interaction, gene therapy may be employed to provide an excess of the mutant 
TM6-».7 domain which may compete with the mutant presenilin protein and inhibit or 
block the aberrant interaction. Alternatively, the PS-interacting domain of a PS- 
interacting protein which interacts with a mutant, but not a nomal, presenilin may be 
5 encoded and expressed by a recombinant construct in order to compete with, and 
thereby inhibit or block, the aberrant interaction. 

Retroviral vectors can be used for somatic cell gene therapy especially 
because of their high efficiency of infection and stable integration and expression. A 
full length PS-interacting protein gene, subsequences encoding functional domains of 

10 these proteins, or any of the other therapeutic peptides described above, can be cloned 
into a retroviral vector and expression may be driven from its endogenous promoter, 
from the retroviral long terminal repeat, or from a promoter specific for the target cell 
type of interest (e.g., neurons). Other viral vectors which can be used include adeno- 
associated virus, vaccinia virus, bovine papilloma virus, or a herpes virus such as 

15 Epstein-Barr virus. 

C. Immunotherapy 

Immunotherapy is also possible for Alzheimer's Disease. Antibodies may 
be raised to a normal or mutant PS-interacting protein (or a portion thereof) and are 
administered to the patient to bind or block an aberrant interaction (e.g., with a mutant 
presenilin) and prevent its deleterious effects. Simultaneously, expression of the 
normal protein product could be encouraged. Altematively, antibodies may be raised 
to specific complexes between mutant or wild-type PS-interacting proteins and their 
interaction partners. 

A further approach is to stimulate endogenous antibody production to the 
25 desired antigen. Administration could be in the form of a one time immunogenic 

preparation or vaccine immunization. The PS-interacting protein or other antigen may 
be mixed with pharmaceutically acceptable carriers or excipients compatible with the 
protein. The immunogenic composition and vaccine may further contain auxiliary 
substances such as emulsifying agents or adjuvants to enhance effectiveness. 
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Immunogenic compositions and vaccines may be administered parenteraliy by 
injection subcutaneously or intramuscularly. 

D. Small Molecule Therapeutics 

As described and enabled herein, the present invention provides for a 
5 nimiber of methods of identifying small molecules or other compounds which may be 
useful in the treatment of Alzheimer's Disease or other disorders caused by mutations 
in the presenilins or PS-interacting proteins. Thus, for example, the present invention 
provides for methods of identifying proteins which bind to normal or mutant PS- 
interacting proteins (aside fi-om the presenilins). The invention also provides for 
10 ■ methods of identifying small molecules which can be used to disrupt aberrant 

interactions between mutant presenilins and/or PS-interacting proteins and such other 
binding proteins or other cell components. 



Examples 

15 Example 1 . Isolation of PS-interacting proteins bv two-hvbrid veast system. 

To identify proteins interacting with the presenilin proteins, a 
commercially available yeast two-hybrid kit ("Matchmaker System 2" from Clontech, 
Palo Alto, C A) was employed to screen a brain cDNA library for clones which 
interact with functional domains of the presenilins. In view of the likelihood that the 

20 TM6->7 loop domains of the presenilins are important functional domains, partial 
cDNA sequences encoding either residues 266-409 of the normal PSl protein or 
residues 272-390 of the normal PS2 protein were ligated in-frame into the EcoRI and 
BamHI sites of the pAS2-l fusion-protein expression vector (Clontech). The resultant 
fusion proteins contain the GAL4 DNA binding domain coupled in-frame either to the 

25 TM6->7 loop of the PSl protein or to the TM6^7 loop of the PS2 protein. These 
expression plasmids were co-transfomied into S. cerevisiae strain Y190 together with 
a library of human brain cDNAs ligated into the pACT2 yeast fusion-protein 
expression vector (Clontech) bearing the GAL4 activation domain using modified 
lithium acetate protocols of the "Matchmaker System 2" yeast two-hybrid kit 

30 (Clontech, Palo Alto, CA). Yeast clones bearing human brain cDNAs which interact 
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with the TM6-).7 loop domain were selected for His- resistance by plating on SD 
minimal medium lacking histidine and for pgaH- activation by color selection. The 
His+ Pgal+ clones were then purged of the pAS2-l "bait" construct by culture in 
lOtig/ml cyclohexamide and the unknown "trapped" inserts of the human brain 
5 cDNAs encoding PS-interacting proteins were isolated by PCR and sequenced. Of 6 
million initial transformants, 200 positive clones were obtained after His- selection, 
and 42 after pgal+ color selection, carried out in accordance with the manufacturer's 
protocol for selection of positive colonies. Of these 42 clones there were several 
independent clones representing the same genes. 

To address the likelihood that mutations in the presenilins cause AD 
through the acquisition of a novel but toxic function (i.e., dominant gain of function 
mutation) which is mediated by a novel interaction between the mutant proteins and 
one or more other cellular proteins, the human brain cDNA library cloned into the 
pACT2 expression vector (Clontech) was re-screened using mutant TM6->7 loop 
15 domain sequences as described above and according to manufacturer's protocols. In 
particular, mutant prcsenilin sequences corresponding to residues 260-409 of PSl 
TM6-^7 loop domains bearing mutations L286V, L392V and A290-319 were ligated 
in-frame into the GAL4 DNA-binding domain of the pAS2-l vector (Clontech) and 
used to screen the human brain cDNA:GAL4 activation domain library of pACT 
vectors (Clontech). Yeast were co-transfonmed, positive colonies were selected, and 
"trapped" sequences were recovered and sequenced as described above. In addition to 
some of the same sequences recovered with the nonnal TM6->7 loop domains, 
several new sequences were obtained which reflect aberrant interactions of the mutant 
presenilins with normal cellular proteins. 

The recovered and sequenced clones corresponding to these PS-interacting 
proteins were compared to the public sequence databases using the BL:ASTN 
algorithm via the NCBI e-mail server. Descriptions of several of these clones follow: 

Antisecretorv Factor/ PrntM some SSa Suhimi t Two overlapping clones 
(Y2H29 and Y2H31) were identified which correspond to a C-teiminal fragment of a 
protein alternatively identified as Antisecretory Factor ("ASF") or the Multiubiquitin 
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chain binding S5a subunit of the 26S proteasome ("S5a") (Johansson et al. 1995; 
Ferrell et al., 1996). The complete nucleotide and amino acid sequences of the S5a 
subunit are available through the public databases under Accession number US 1007 
and are reproduced here as SEQ ID NO: 1 and SEQ ID NO: 2. The nucleotide 
5 sequences of the Y2H29 and Y2H31 clones include nucleotides 351-1330 of SEQ ID 
NO: 1 and amino acid residues 70-377 of SEQ ID NO: 2. Thus, residues 70-377 of 
the full S5a subunit include the PS-interacting domain of this protein. Residues 206- 
377 of S5a contain certain motifs that are important for protein-protein interactions 
(Ferrell etaL, 1996). 

10 The PS 1 -S5a subunit interaction was directly re-tested for both wild type 

and mutant PSl TM6->7 loop (residues 260-409) by transforming Y187 yeast cells 
with the appropriate wild type or mutant (L286V, L392V or A290-319) cDNA ligated 
in-frame to the GAL4-DNA binding domain of pACT2. The A290-319 mutant fusion 
construct displayed autonomous Pgal activation in the absence of any S5a "target 

15 sequence" and, therefore, could not be further analyzed. In contrast, both the L286V 
and L392V mutant constructs interacted specifically with the S5a construct. 
Quantitative assays, however, showed that these interactions were weaker than those 
involving the wild type PSIj^moq sequence and that the degree of interaction was 
crudely correlated with the age of onset of FAD. The difference in Pgal activation 

20 was not attributable to instability of the mutant PSl construct mRNAs or fusion 
proteins because Western blots of lysates of transformed yeast showed equivalent 
quantities of mutant or wild-type fusion proteins. 

Because one of the putative functions of S5a is to bind multi-ubiquitinated 
proteins, the PSl :S5a interaction observed in S. cerevisiae could arise either through 

25 yeast-dependent ubiquitination of the PS 1 2^0^ construct, or by direct interaction. The 
former would reflect a degradative pathway, a functional and perhaps reciprocal 
interaction between PSl and S5a, or both. A direct interaction is favored by the fact 
that the PSl:S5a interaction is decreased rather than increased by the presence of the 
L286V and L392V mutations, and by the fact that neither of these mutations affect 

30 ubiquitin conjugation sites in the PSli^o^ loop K265, K311, K314 or K395). 
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To further examine this possibility, we investigated the direct interaction of 
recombinant His-tagged fiision proteins corresponding to full length S5a and the 
PSl2«M09 loop. Partially purified recombinant His-tagged PSlj^^ loop and His- 
tagged S5a proteins and ^propriate controls were mixed in phosphate buffered saline. 
5 The mixture was then subjected to size exclusion chiomatogr^hy, and eluates were 
examined by SDS-PAGE and Western blotting using anti-His-tag monoclonal 
antibodies (Quiagen). In the crude PSl^^^ loop preparation alone, the ?Sl,^ loop 
eluted from the size exclusion column as a broad peak at 35 minutes. In the crude S5a 
preparation alone, S5a eluted at 25 minutes. However, when the crude PS l^^^ loop 
10 and S5a preparations were mixed, there was a significant shift in the elution of PS 1 
409 toward a higher molecular weight complex. Co-elution of S5a and PSlj^o-w in the 
same fraction was confirmed by SDS-PAGE and Western blotting of fractions using 
the anti-His-tag antibody. These results are consistent with a ubiquitin-independent 
and, therefore, possibly functional interaction. 

GT24 and related genes with homolnp^ y to pl20/plakoglobin faTnily Five 
over-lapping clones (Y2H6, Y2H10b, Y2H17h2, Y2H24, and Y2H25) were obtained 
which interact with the normal PSI TM6-^7 loop domain and which appear to 
represent at least one novel gene. The Y2H24 clone was also found to interact with 
the mutant PSI TM6->7 loop domains. Note that it appears that more than one 
20 member of the gene family was isolated, suggesting a family of genes interacting 
differentially with different preseniUns. The most complete available cDNA 
corresponding to these clones was designated GT24 and is disclosed herein as SEQ ID 
NO: 3 and has been deposited with GenBank as Accession number U8I004. The 
open reading frame suggests that GT24 is a protein of at least 1040 amino acids with a 
unique N-terminus, and considerable homology to several armadillo (ann) repeat 
proteins at its C-terminus. The predicted amino acid sequence of GT24 is disclosed 
herein as SEQ ID NO: 4. Thus, for example, residues 440-862 of GT24 have 32-56% 
identity (p=1.2e'") to residues 440-854 of murine pl20 protein (Accession number 
Z17804), and residues 367-815 of GT24 have 26^2% identity (p=0.0017) to residues 
245-465 of the D. melanogaster armadillo segment polarity protein (Accession 
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number PI 8824). The GT24 gene maps to chromosome 5pl5 near the anonymous 
microsatellite marker D5S748 and the Cri-du-Chat syndrome locus. This sequence is 
also nearly identical to portions of two human ESTs of unknown function (i.e., 
nucleotides 2701-3018 of Accession number F08730 and nucleotides 2974-3348 of 
5 Accession number Tl 8858). These clones also show lower degrees of homology with 
other partial cDNA and gDNA sequences (e.g., H17245, T06654, T77214, H24294, 
M62015, T87427 and G04019). 

p0071 gene . An additional His", Pgal* clone isolated in the initial screening 
with wild type PS 1266^ "bait" had a similar nucleotide sequence to GT24 (target 

10 clone Y2H25; Accession number U81005). and would also be predicted to encode a 
peptide with C-terminal mn repeats. A longer cDNA sequence closely corresponding 
to the Y2H25 clone has been deposited in GenBank as human protein p0071 
(Accession number X81889). The nucleotide and corresponding amino acid 
sequences of p0071 are reproduced herein as SEQ ID NOs: 5 and 6. Comparison of 

15 the predicted sequence of the p0071 ORF with that of GT24 confirms that they are 
related proteins with 47% overall amino acid sequence identity, and with 70% identity 
between residues 346-862 of GT24, and residues 509-1022 of p0071 (which includes 
residues encoded by the Y2H25 cDNA). The latter result strongly suggests that PSl 
interacts with a novel class of arm repeat containing proteins. The broad - 4 kb 

20 hybridization signal obtained on Northern blots with the unique 5* end of GT24 could 
reflect either alternate splicing/polyadenylation of GT24, or the existence of 
additional members of this family with higher degrees of N-terminal homology to 
GT24thanp0071. 

Rabl 1 gene. This clone (Y2H9), disclosed herein as SEQ ID NO: 7, was 
25 identified as interacting with the normal PS 1 TM6^7 loop domain and appears to 
correspond to a known gene, Rabl 1, available through Accession numbers X56740 
and X53143. Rabl 1 is believed to be involved in protein/vesicle trafficking in the 
ER/Golgi. Note the possible relationship to processing of membrane proteins such as 
BAPP and Notch with resultant overproduction of toxic AB peptides (especially 
30 neurotoxic AB,^(43) isoforms) (Scheuner, et al, 1995). 
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Retinoid X recentor-fl genfi This clone (Y2H23b), disclosed herein as 
SEQ ID NO: 8, was identified as interacting with the normal PSl TM6^7 loop 
domain and appears to correspond to a known gene, known variously as the retinoid X 
recqjtor-p, nuclear receptor co-regulator or MHC Class I regulatory element, and 
5 available through Accession numbers M84820, X63522 and M81766. This gene is 
believed to be involved in intercellular signaling, suggesting a possible relationship to 
the intercellular signaling function mediated by C. eleeans sell 2 and Notch/lin-12 
(transcription activator). 

Cytoplasmic chaoeronin gene This clone (Y2H27). disclosed herein as 
10 SEQ ED NO: 9, was identified as interacting with the normal PS 1 TM6->7 loop 
domain and appears to correspond to a known gene, a cytoplasmic chaperonin 
containing TCP-1, available through Accession numbers Ul 7 104 and X74801. 

Unknown gene (Y2ms\ This clone (Y2H35), disclosed herein as SEQ 
m NO: 10, was identified as interacting with the normal PSl TM6->7 loop domain 
15 and appears to correspond to a known gene of unknown function, available through 
Accession number R12984, which shows conservation down through yeast. 

Unknown gene (Y2H1 71). This clone (Y2H171), disclosed herein as SEQ 
ID NO: 1 1, was idenUfied as interacting with the normal PSl TM6-^7 loop domain 
and appears to correspond to a known expressed repeat sequence available through 
20 Accession number D55326. 

UnknowT^RenefY2H41). This clone (Y2H41) was identified which reacts 
strongly with the TM6->7 loop domains of both PSl and PS2 as well as the mutant 
loop domains of PSl . The sequence, disclosed as SEQ ID NO: 12, shows strong 
homology to an EST of imknown function (Accession number T64843), 
Example 2. Isolation of oresenilin b inding nroteins by affinitv chromatography 

To identify the proteins which may be involved in the biochemical 
function of the presenilins, PS-interacting proteins were isolated using affinity 
chromatography. A GST-fiision protein containing the PSl TM6->7 loop, prepared 
as described in Example 3, was used to probe human brain extracts, prepared by 
homogenizing brain tissue by Polytron in physiological salt solution. Non-specific 
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binding was eliminated by pre-clearing the brain homogenates of endogenous GST- 
binding components by incubation with glutathione-Sepharose beads. These GST- 
free homogenates were then incubated with the GST-PS fusion proteins to produce the 
desired complexes with functional binding proteins. These complexes were then 
5 recovered using the affinity glutathione-Sepharose beads. After extensive washing 
with phosphate buffered saline, the isolated collection of proteins was separated by 
SDS-polyacrylamide gel electrophoresis (SDS-PAGE; Tris-tricine gradient gel 4- 
20%). Two major bands were observed at -14 and 20 kD in addition to several 
weaker bands ranging from SO to 60 kD. 
10 The same approach may now be used to identify proteins which have 

binding activity for the PS-interacting proteins and, thereby, to fiirther elucidate the 
etiology of AD and to identify additional therapeutics targets for intervention in AD 
and related disorders. 

Example 3. Eukarvotic and prokarvotic expression vector systems. 

15 Constructs suitable for use in eukaryotic and prokaryotic expression 

systems have been generated using different classes of PSl nucleotide cDNA 
sequence inserts. In the first class, termed full-length constructs, the entire PS 1 cDNA 
sequence is inserted into the expression plasmid in the correct orientation, and 
includes both the natural 5* UTR and 3' UTR sequences as well as the entire open 

20 reading frame. The open reading frames bear a nucleotide sequence cassette which 
allows either the wild type open reading frame to be included in the expression system 
or alternatively, single or a combination of double mutations can be inserted into the 
open reading frame. This was accomplished by removing a restriction fragment from 
the wild type open reading frame using the enzymes Narl and Pflml and replacing it 

25 with a similar fragment generated by reverse transcriptase PGR and bearing the 

nucleotide sequence encoding either the M146L mutation or the H163R mutation. A 
second restriction fragment was removed from the wild type normal nucleotide 
sequence for the open reading frame by cleavage with the enzymes Pflml and Ncol 
and replaced with a restriction fragment bearing the nucleotide sequence encoding the 

30 A246E mutation, the A260V mutation, the A285 V mutation, the L286V mutation, the 
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L392V mutation or the C410Y mutation. A third variant, bearing a combination of 
either the M146L or H163R mutation in tandem with one of the remaining mutations, 
was made by linking a Narl-PflmI fragment bearing one of the former mutations and a 
Pflml-Ncol fragment bearing one of the latter mutations. 

The second class of cDNA inserts, termed truncated constructs, was 
constructed by removing the 5' UTR and part of the 3' UTR sequences from full 
length wild type or mutant cDNA sequences. The 5' UTR sequence was replaced with 
a synthetic oligonucleotide containing a Kpnl restriction site (GGTAC/C) and a small 
sequence (GCCACC) to create a Kozak initiation site around the ATG at the 
begimiing of the PSl ORF. The 3' UTR was replaced with an oligonucleotide with an 
artificial EcoRI site at the 5' end. Mutant variants of this construct were then made by 
inserting the mutant sequences described above at the Narl-Pfhnl and Pslml-Ncol 
sites as described above. 

For eukaryotic expression, these various cDNA constracts bearing wild 
type and mutant sequences, as described above, were cloned into the expression 
vector pZeoSV in which the SV60 promoter cassette had been removed by restriction 
digestion and replaced with the CMV promoter element of pcDNA3 (Invitrogen). For 
piokaiyotic expression, constnicts have been made using the glutathione S-transferase 
(GST) fusion vector pGEX-kg. The inserts which have been attached to the GST 
fiision nucleotide sequence are the same nucleotide sequences described above 
bearing either the normal open reading frame nucleotide sequence, or bearing a 
combination of single and double mutations as described above. These GST fusion 
constructs allow expression of the partial or full-length protein in prokaryotic cell 
systems as mutant or wild type GST fusion proteins, thus allowing purification of the 
full-length protein followed by removal of the GST fusion product by thrombin 
digestion. A fiirther cDNA construct was made with the GST fusion vector, to allow 
the production of the amino acid sequence corresponding to the hydrophilic acidic 
loop domain between TM6 and TM7 of the full-length protein, either as a wild type 
nucleotide sequence or as a mutant sequence bearing either the A285 V mutation, the 
L286V mutation or the L392V mutation. This was accomplished by recovering wild 
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type or mutant sequence from appropriate sources of RNA using a 5' oligonucleotide 
primer with a 5' BamHI restriction site (G/GATCC), and a 3* primer with a 5* EcoRI 
restriction site (G/AATTC). This allowed cloning of the appropriate mutant or wild 
type nucleotide sequence corresponding to the hydrophilic acidic loop domain at the 
5 BamHI and the EcoRI sites within the pGEX-KG vector. 

The PS-interacting protein genes may be similarly manipulated by 
recombinant means for expression in prokaryotic or eukaryotic hosts. In particular, 
GST or other fusion proteins may be produced which will be useful in assays (e.g., 
yeast two-hybrid studies) for therapeutics. 

10 Example 4. Antibody production. 

Peptide antigens corresponding to portions of the PSl protein were 
synthesized by solid-phase techniques and purified by reverse phase high pressure 
liquid chromatography. Peptides were covalcntly linked to keyhole limpet 
hemocyanin (KLH) via disulfide linkages that were made possible by the addition of a 

15 cysteine residue at the peptide C-terminus of the presenilin fragment. This additional 
residue does not ^ppc^r normally in the protein sequence and was included only to 
facilitate linkage to the KLH molecule. 

A total of three New Zealand white rabbits were immunized with peptide- 
KLH complexes for each peptide antigen in combination with Freund's adjuvant and 

20 were subsequently given booster injections at seven day intervals. Antisera were 
collected for each peptide and pooled and IgG precipitated with ammonium sulfate. 
Antibodies were then affinity purified with Sulfo-Iink agarose (Pierce) coupled with 
the appropriate peptide. This final purification is required to remove non-specific 
interactions of other antibodies present in either the pre- or post-immune serum. 

25 The specificity of each antibody was confirmed by three tests. First, each 

detected single predominant bands of the approximate size predicted for presenilin- 1 
on Western blots of brain homogenate. Second, each cross-reacted with recombinant 
fusion proteins bearing the appropriate sequence. Third each could be specifically 
blocked by pre-absorption with recombinant PSl or the immunizing peptide. 
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Antibodies to peptides derived from the PS-interacting proteins may be 
produced by similar means. 
Examples. Transgenic mice. 

A series of wild type and mutant PSl and PS2 genes were constructed for 
use in the preparation of transgenic mice. Mutant versions of PS I and PS2 were 
generated by site-directed mutagenesis of the cloned cDNAs using standard 
techniques. 

The cDNAs and their mutant versions were used to prepare two classes of 
mutant and wild type PSl and PS2 cDNAs, as described in Example 3. The first 
class, referred to as "full-length" cDNAs, were prepared by removing approximately 
200 bp of the 3' untranslated region immediately before the polyA site by digestion 
with EcoRI (PSl) or PvuII (PS2). The second class, referred to as "truncated" 
cDNAs, were prepared by replacing the 5' untranslated region with a ribosome 
binding site (Kozak consensus sequence) placed immediately 5' of the ATG start 
15 codon. 

Various fill! length and truncated wild type and mutant PSl and PS2 
cDNAs, prepared as described above, were introduced into one or more of the 
following vectors and the resulting constructs were used as a source of gene for the 
production of transgenic mice. 

The cos.TET expression ver^nr- This vector was derived from a cosmid 
clone containing the Syrian hamster PrP gene. It has been described in detail by Scott 
et al. (1992) and Hsiao et al. (1995). PSl and PS2 cDNAs (full length or truncated) 
were inserted into this vector at its Sail site. The final constructs contain 20 kb of 5' 
sequence flanking the inserted cDNA. This 5' flanking sequence includes the PrP 
gene promoter. 50 bp of a PrP gene 5' untranslated region exon, a splice donor site, a 1 
kb intron, and a splice acceptor site located immediately adjacent to the Sail site into 
which the PSl or PS2 cDNA was inserted. The 3' sequence flanking the inserted 
cDNA includes an approximately 8 kb segment of PrP 3' untranslated region 
including a polyadenylation signal. Digestion of this constmct with NotI (PSl) or 
Fsel (PS2) released a fragment containing a mutant or wild type PS gene under the 
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control of the PrP promoter. The released fragment was gel purified and injected into 
the pronuclei of fertilized mouse eggs using the method of Hsiao et al. (1995). 

Platelet-derived growth factor receptor 6-subunit constructs : PScDNAs 
were also introduced between the Sail (full length PSl cDNAs) or Hindin (truncated 
5 PSl cDNAs, full length PS2 cDNAs. and truncated PS2 cDNAs) at the 3' end of the 
human platelet derived growth factor receptor P-subunit promoter and the EcoRI site 
at the 5' end of the SV40 polyA sequence and the entire cassette was cloned into the 
pZeoSV vector (Invitrogen, San Diego, CA.). Fragments released by Scal/BamHI 
digestion were gel purified and injected into the pronuclei of fertilized mouse eggs 

10 using the method of Hsiao et al. ( 1 995); 

Human fi-actin constructs : PSl and PS2 cDNAs were inserted into the Sail 
site of pBAcGH. The construct produced by this insertion includes 3.4 kb of the 
human p actin 5* flanking sequence (the human p actin promoter, a spliced 78 bp 
human p actin 5* imtranslated exon and intron) and the PSl or PS2 insert followed by 

15 2.2 kb of human growth hormone genomic sequence containing several introns and 
exons as well as a polyadenylation signal. Sfil was used to release a PS-containing 
fragment which was gel purified and injected into the pronuclei of fertilized mouse 
eggs using the method of Hsiao et al. (1995). 

Phosphoelvcerate kinase constructs : PSl and PS2 cDNAs were introduced 

20 into the pkJ90 vector. The cDNAs were inserted between the Kpnl site downstream 
of the human phosphoglycerate kinase promoter and the Xbal site upstream of the 3* 
untranslated region of the human phosphoglycerate kinase gene. PvuII/Hindlll (PS I 
cDNAs) or PvuII (PS2 cDNAs) digestion was used to release a PS-containing 
fragment which was then gel purified and injected into the pronuclei of fertilized 

25 mouse eggs as described above. 

Analysis of AB in transgenic murine hippocampus : To analyze the effect 
of a mutant himian PSl transgene in mice, a PSl mutation observed in conjunction 
with a particularly severe form of early-onset PSl-linked Alzheimer's disease was 
used, namely the M146L missense mutation (Sherrington et al., 1995). The animals, 

30 which were heterozygous for the PS 1 mutant transgene on a mixed FVB-C57BL/6 
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strain background, were cross-bred with similar mice bearing the human wild-type 
PAPP„5 cDNA under the same Syrian hamster PrP promoter similar to those animals 
recently described by Hsiao et al., 1995. These cross breedings were done because it 
is thought that human Ap is more susceptible to the formation of aggregates than are 
5 murine A|3 peptides. 

The progeny of these PS1m,46l x pAPP^ cross-breedings were then 
genotyped to identify animals that contained both the human wild-type pAPP^,, 
transgene and also the mutant human PS\^,^ transgene. These mice were aged umil 
two to three months of age and then sacrificed, with the hippocampus and neocortex 
10 being dissected rapidly fi-om the brain and frozen. Litter mates of these mice, which 
contained only the wild-type human PAPP.,, transgene were also sacrificed, and their 
hippocampi and neocortices were dissected and r^idly frozen as well. 

The concentration of both total AP peptides (Ap^^, and Ap^^,,,,) as well 
as the subset of Ap peptides ending on residues 42 or 43 (long-tailed Ap^j peptides) 
15 were then measured using a two-sandwich ELISA as described previously (Tamaoka 
et al., 1994; Suzuki et al., 1994). These results convincingly showed a small increase 
in total AP peptides in the double transgenic animals bearing wild-type human 
PAPPj,5 and mutant human PSIm^^l transgenes compared to the wild-type human 
PAPPwj controls. More impressively, these measurements also showed that there was 
20 an increase in the amount of long-tailed Ap peptides ending on residues 42 or 43 
(Ap«). In contrast. litter mates bearing only the wild-type human PAPP^, transgene 
had Ap« long-tailed peptide values which were below the limit of quantitation 
("BLQ"). 

These observations therefore confmn that the construction of transgenic 
25 animals can recapitulate some of the biochemical features of human Alzheimer's 
disease (namely the overproduction of Ap peptide and. in particular, overproduction 
of long-tailed isoforms of Ap peptide). These observations thus prove that the 
transgenic models are in fact useful in exploring therapeutic targets relevant to the 
treatment and prevention of Alzheimer's disease. 
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Analysis of hippocampus dependent memory functions in PSl transgenic 
mice: Fourteen transgenic C57BL/6 x FVB mice bearing the human PS1mm6v mutant 
transgene under the PrP promoter (as described) above and 12 wild type litter mates 
aged 2.5-3 months of age (both groups were balanced for age, weight, and sex) were 
5 investigated for behavioral differences attributable to the mutant transgene. Also the 
qualitative observation of murine behavior in their home cages did not indicate 
bimodal distribution of behaviors in the sample of animals. 

Experiment 1 . To test for subtle differences in exploratory 
behavior (e.g. locomotion, scanning of the environment through rearing, and patterns 

10 of investigation of unfamiliar environment), both PS 1 m\46v and wild type litter mates 
were tested in the open-field (Janus, ct al. 1995). The results of the test revealed no 
significant differences between transgenics and controls in exploration of a new 
environment measured by mice locomotor behaviors (walking, pausing, wall leaning, 
rearing, grooming), (F(1^4) = .98, NS). Thus, differences any in behavior on the 

15 Monris water maze test (see below) cannot be attributed to differences in locomotor 
abilities, etc. 

Experiment 2 . One week after the open-field test, the PS 1mi46v 
mutant transgenic mice and their litter mates were trained in the Morris water maze. 
In this test, a mouse has to swim in a pool in order to find a submerged escape 

20 platform. The animal solves that test through learning the location of the platform 
using the available extra-maze spatial cues (Morris, 1990). This test was chosen 
because there is strong evidence that the hippocampal fonnation is involved in this 
form of learning. The hippocampus is also a major site of AD neuropathology in 
humans and defects in spatial learning (geographic disorientation, losing objects, 

25 wandering, etc.) are prominent early features of human AD. As a result the test is 
likely to detect early changes equivalent to those seen in human AD. The Morris test 
is conducted in three phases. In the first phase (the learning acquisition phase), the 
mouse has to learn the spatial position of the platform. In the second phase (the probe 
trial), the platform is removed fi-om the pool and the mouse's search for the platform 

30 is recorded. In the final phase (the learning transfer phase), the platform is replaced in 
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a new position in the pool, and the mouse has to learn that new spatial position of the 
platform. 

Transgenic and wild type mice did not differ in their latencies to find the 
platform during learning acquisition (F(1.24) = 0.81. NS), and both groups showed 
rapid learning across trials (F(10,15) = 1 1.57. p < 0.001). During the probe trial 
phase, mice from both groups searched the quadrant of the pool which originally 
contained the platform significantly longer than other areas of the pool which had not 
contained the platform (F(3.22) = 28.9. p < 0.001). However, the wild type controls 
showed a trend which was not quite statistically significant (t(24) = 1.21, p = 0.24) for 
an increased number of crossings of the exact previous position of the platfoim. In 
the leaiTiing transfer test, both groups showed the same latency of finding the new 
position of the platfonn in the initial block of trials (t(24) =1.11, NS). Such long 
latency to find the new spatial position is expected because the mice spent most of 
their time searching for the platform in the old spatial position. However, in later 
trials in the learning transfer phase, the wild type mice showed shorter swim latencies 
to the new position of the platform compared to the PS1m„*v mutant transgenics 
(F(1.24) 2.36, p = 0.14). The results indicate that PS1m,«v mutant transgenic mice 
were less flexible in transferring learned information to a new situation and tended to 
persevere in their search for the platfomi in the old location. 

Thus, although no differences were found in the spontaneous exploration 
of a new environment and in the acquisition of new spatial information between the 
wild type and the PSl„,^v mutant transgenic mice, the ?Sl^^,^ mutant transgenic 
mice were impaired in switching and/or adapting this knowledge in later situations. 

Electrgphysiolopica l Recordinps in the hiDDocamp ... of mutant tn.n.p ^in 
mice: Five to six months old litter mate control and human PSl^.^v mutant 
transgenic mice on the same C57BL/6 x FVB strain backgrounds as above were used 
to study long term potentiation (LTP) as an electrophysiologic coirelate of learning 
and memory in the hippocampus. Recordings were carried out on 400 ^m thick 
hippocampal slices according to conventional techniques. Briefly, brains were 
removed and transverse sections containing hippocampi were obtained within 1 min. 
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after mice were decapitated under halothane anesthesia. Slices were kept at room 
temperature in oxygenated artificial cerebrospinal fluid for one hour prior to 
recording. One slice at a time was transferred to the recording chamber, where they 
were maintained at 32 *'C in an interface between oxygenated artificial cerebrospinal 
5 fluid and humidified air. Shces were then allowed to equilibrate in the recording 
chamber for another hour. 

Extracellular field recordings were carried out in the CAl subfield of the 
hippocampus at the Schaeffer collateral-pyramidal cell synapse. Synaptic responses 
were induced by the stimulation of Schaeffer collaterals at a firequency of 0.03 Hz and 

10 an intensity of 30-50 % of maximal response. Tetani to evoke long-term potentiation 
consisted of 5 trains of 100 Hz stimulation lasting for 200 ms at an intertrain interval 
of 10 seconds. Field potentials were recorded using an Axopatch 200B amplifier 
(Axon Instrument). Glass pipettes were fabricated fi-om borosilicate glass with an 
outer diameter of 1.5 mm, and pulled with a two step Narishige puller. Data were 

15 acquired on a 486-IBM compatible computer using PCLAMP6 software (Axon 
Instrument). 

To test for any abnormality in presynaptic function, we investigated the 
differences in paired-pulse faciUtation, which is an example of use-dependent increase 
in synaptic efficacy and is considered to be presynaptic in origin. In hippocampus, 

20 when two stimuli are delivered to the Schaeffer collaterals in rapid succession, paired- 
pulse facilitation manifests itself as an enhanced dendritic response to the second 
stimulus as the iiiterstimulus interval gets shorter. In three pairs of wild- 
type/transgenic mice, we did not observe any difference in the paired-pulse facilitation 
over an interstimulus interval range of 20 ms to 1 sec. These data suggest that in 

25 PS1mi4«v mutant transgenic mice, the excitability of Schaeffer collateral fibers and 
neurotransmitter release are likely to be normal. 

Tetanic stimulation induced a long-lasting increase in the synaptic strength 
in both control (n = 3) and PS1mi4^v mutant transgenic mice (n = 2). In slices obtained 
from the PS1mi46v mutant transgenic mice, long-lasting increase in the synaptic 

30 strength was 30.% more than that obtained from control mice. 
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Although preferred embodiments of the invention have been described 
herein in detail, it will be understood by those skilled in the art that variations may be 
made thereto without departing from the spirit of the invention or the scope of the 
5 appended claims. 
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SEQUBNCC LISTING 



(II GENERAL INTORMATION: 



ti) APPLICANT: ST. GEORGE-H YSLOP , PETER H. 

ROKMENS, JOHANNA M. 
FRASER, PAUL £. 

(ii) TITLE OF INVENTION: NUCLEIC ACIDS AND PROTEINS RELATED TO 
ALZHEIMER'S DISEASE, AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 12 

(XV) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sim & McBurney 

(B) STREET: 330 University Avenue, 6^" Floor 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: MSG 1R7 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS- DOS 

(Dl SOFTWARE: Patentin Release #1,0, Version 11.30 

ivil CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 27-JAN-1997 

(C) CLASSIFICATION; 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/592,541 

(B) FILING DATE: 26-JAN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,673 

(B) FILING DATE: 05-JUL-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,700 

(B) FILING DATE: 12-JUL-1996 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/029,895 

(B) FILING DATE: 08-NOV-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DOCKET! CAN-006PR(d) 

(B) FILING DATE: 02-JAK-1997 



(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: RAE, Patricia A. 

lix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 595-1155 

(B) TELEFAX: (416) 595-1163 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : single 

(D) TOPOLOGY: linear 



( ix ).- FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 145. . 1275 

(D) OTHER INFORMATION: /product- "S5a- 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

AATTCCCAAA TGACCTTTTA TTTCATACAG AGATACAAAG GCAACTATGT GCAGCAACAA 60 

TCTGATGGGC AGTCCAAACT CTTGGGAGGA AGTAAATTCA TGGTAAATGT CATGATGGCG 120 

GTCGGGAGGG AGGAAGGTGG CAAG ATG GTG TTG GAA AGC ACT ATG GTG TGT 171 

Met val Leu Glu Ser Thr Met Val Cys 
1 5 



GTG GAC AAC AGT GAG TAT ATG CGG AAT GGA GAC TTC TTA CCC ACC AGG 219 

7al - - 

10 



Val Asp Asn Ser Glu T^r Met Arg Asn Gly Asg Phe Leu Pro Thr Ar^ 



CTG CAG GCC CAG CAG GAT GCT GTC AAC ATA GTT TGT CAT TCA AAG ACC 267 
Leu Gin Ala Gin Gin Asp Ala Val Asn lie Val Cys His Ser Lys Thr 
30 35 40 
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CGC AGC AAC CCT GAG AAC AAC GTG GGC CTT ATC ACA CTG GCT AAT GAC 
Arg Ser Asn Pro Glu Asn Asn Val Gly Leu lie Thr Leu Ala Asn Asd 
45 50 55 

TGT GAA GTG CTG ACC ACA CTC ACC CCA GAC ACT GGC CGT ATC CTG TCC 
Cys Glu Val Leu Thr Thr Leu Thr Pro Asp Thr Gly Arq lie Leu Ser 
60 65 70 

AAG CTA CAT ACT GTC CAA CCC AAG GGC AAG ATC ACC TTC TGC ACG GGC 
Lys Leu His Thr Val Gin Pro Lys Giy Lys lie Thr Phe Cys Thr Gly 
'5 8 0 8 5 

ATC CGC GTG GCC CAT CTG GCT CTG AAG CAC CGA CAA GGC AAG AAT CAC 
lie Arg Val Ala His Leu Ala Leu Lys His Ara Gin ciy Lyl AsA His 
^" 95 100 105 

AAG ATG CGC ATC ATT GCC TTT GTG GGA AGC CCA GTG GAG GAC AAT GAG 
Lys Met Arg He lie Ala Phe Val Gly Ser Pro Val Glu Asp Asn Glu 
110 115 120 

AAG GAT CTG GTG AAA CTG GCT AAA CGC CTC AAG AAG GAG AAA GTA AAT 
Lys Asp Leu Val Lys Leu Ala Lys Arg Leu Lys Lys Glu lys Val As3[ 
12b 130 

GTT GAC ATT ATC AAT TTT GGG GAA GAG GAG GTG AAC ACA GAA AAG CTG fim 
val Asp lie He Asn Phe Gly Glu Glu Glu Val Asn Thr Gl2 Zyt lIu " 

ACA GCC TTT GTA AAC ACG TTG AAT GGC AAA GAT GGA ACC GGT TCT CAT fi «i 1 

Thr Ala Phe Val Asn Thr Leu Asn Gly Lys Asp Gly Thr Gly ler His 

■^33 IdQ let.' 



363 



4H 



459 



507 



555 



843 



891 



?J9 ilu^ H^? ^^'^ '^TG GCT GAT GCT CTC ATC AGT 699 

Leu val Thr Val Pro Pro Gly Pro Ser Leu Ala Asp Ala Leu lie Ser 

TCT CCG ATT TTG GCT GGT GAA GGT GGT GCC ATG CTG GGT CTT GGT GCC 747 
Ser Pro He Leu Ala Gly Glu Gly Gly Ala Met Leu Gly Leu Gly Ala 
190 195 200 

AGT GAC TTT GAA TTT GGA GTA GAT CCC AGT GCT GAT CCT GAG CTG GCC 795 
Ser Asp Phe Glu Phe Gly Val Asp Pro Ser Ala Asp Pro Glu Leu Ala 
205 210 215 

TTG GCC CTT CGT GTA TCT ATG GAA GAG CAG CGG CAG CGG CAG GAG GAG 
Leu Ala Leu Arg Val Ser Met Glu Glu Gin Arg Gin irg Gin GlS GlS 

225 230 

r^f! f?'^ n*^^ 5?* ^9'^ ^^'^ ^^CT GCT GAG GCC GGG ATT GCT 

Glu Ala Arg Arg Ala Ala Ala Ala Ser Ala Ala Glu Ala Gly lie Ala 
^JD 240 245 

ACG ACT GGG ACT GAA GAC TCA GAC GAT GCC CTG CTG AAG ATG ACC ATC qiQ 
Thr Thr Gly Thr Glu Asp Ser Asp Asp Ala Leu LeS tyl hU ?hr lie 
255 260 265 

AGC CAG CAA GAG TTT GGC CGC ACT GGG CTT CCT GAC CTA AGC AGT ATG Qfi7 

ser Gin Gin Glu Phe Gly Arg Thr Gly Leu Pro Asp Leu Ser Ser Set 

275 280 

ACT GAG GAA GAG CAG ATT GCT TAT GCC ATG CAG ATn Trr rrr r^r- r-/** 

Thr Glu Glu Glu Gin He Ala Tyr Ala Sit Ixr Sit I" l" lln g" 

^05 290 295 

a?^ 11'^ S?^ TCA GCA GAC ATT GAT GCC AGC TCA GCT 10R3 

Ala Glu Phe Gly Gin Ala Glu Ser Ala Asp He Asp Ala ter Se? All 
•'"^ 305 

hIt a«« tS^ l^^ ^'^^ ^'^^ GAG GAT GAT TAC GAC GTG ATG 1131 

Met Asp Thr Ser Glu Pro Ala Lys Glu Glu Asp Asp Tyr Asp Val Met 

320 325 

CAG GAC CCC GAG TTC CTT CAG AGT GTC CTA GAG AAC CTC CCA GCT rrr i i i a 

Gin Asp Pro Glu Phe Leu Gin Ser Val LeS Slu Asn llu PrS Gly vll '^^^ 
j35 340 345 

GAT CCC AAC AAT GAA GCC ATT CGA AAT GCT ATG GGC TCC CTG GCC TCC 1097 
Asp Pro Asn Asn Glu Ala He Arg Asn Ala Met Gly Ser lIu Ala slz ^ 
■*^0 355 360 

r?n af^ in^ f^'^^ ^AC AAG AAG GAG GAA GAC AAG AAG l->7 5 

Gin Ala Thr Lgs Asp Gly Lys Lys Asg Lys Lys Glu Glu As| Lys lys 

TGAGACTGGA GGGAAAGGGT AGCTGAGTCT GCTTAGGGAC TGCATGGGGG AATTC 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS* 

, (A> LENGTH: 377 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



1330 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met VaX Leu Glu Ser Thr Met Val Cys Val Asp Asn Ser Glu Tyr Met 
15 10 15 

Arg Asn Gly Asp Phe Leu Pro Thr Arg Leu Gin Ala Gin Gin Asp Ala 
20 25 30 

Val Asn He Val Cys His Ser Lys Thr Arg Ser Asn Pro Glu Asn Asn 
35 40 45 

Val Gly Leu He Thr Leu Ala Asn Asp Cys Glu Val Leu Thr Thr Leu 
50 55 60 

Thr Pro Asp Thr Gly Arg He Leu Ser Lys Leu His Thr Val Gin Pro 
65 70 75 80 

Lys Gly Lys He Thr Phe Cys Thr Gly He Arg Val Ala His Leu Ala 
85 . 90 95 

Leu Lys His Arg Gin Gly Lys Asn His Lys Met Arg He He Ala Phe 
100 105 110 

Val Gly Ser Pro Val Glu Asp Asn Glu Lys Asp Leu Val Lys Leu Ala 
115 120 125 

Lys Arg Leu Lys Lys Glu Lys Val Asn Val Asp He He Asn Phe Gly 
130 135 140 

Glu Glu Glu Val Asn Thr Glu Lys Leu Thr Ala Phe Val Asn Thr Leu 
145 150 155 160 

Asn Gly Lys Asp Gly Thr Gly Ser His Leu val Thr Val Pro Pro Gly 
165 170 175 

Pro Ser Leu Ala Asp Ala Leu He Ser Ser Pro He Leu Ala Gly Glu 
180 185 190 

Gly Gly Ala Met Leu Gly Leu Gly Ala Ser Asp Phe Glu Phe Gly Val 
195 200 205 

Asp Pro Ser Ala Asp Pro Glu Leu Ala Leu Ala Leu Arg Val Ser Met 
210 215 220 

Glu Glu Gin Arg Gin Arg Gin Glu Glu Glu Ala Arg Arg Ala Ala Ala 
225 230 235 240 

Ala Ser Ala Ala Glu Ala Gly He Ala Thr Thr Gly Thr Glu Asp Ser 
245 250 255 

Asp Asp Ala Leu Leu Lys Met Thr He Ser Gin Gin Glu Phe Gly Arg 
260 265 270 

Thr Gly Leu Pro Asp Leu Ser Ser Met Thr Glu Glu Glu Gin He Ala 
275 280 285 

Tyr Ala Met Gin Met Ser Leu Gin Gly Ala Glu Phe Gly Gin Ala Glu 
290 295 300 

Ser Ala Asp He Asp Ala Ser Ser Ala Met Asp Thr Ser Glu Pro Ala 
305 310 315 320 

Lys Glu Glu Asp Asp Tyr Asp Val Met Gin Asp Pro Glu Phe Leu Gin 
325 330 335 

Ser Val Leu Glu Asn Leu Pro Gly Val Asp Pro Asn Asn Glu Ala He 
340 345 350 

Arg Asn Ala Met Gly Ser Leu Ala Ser Gin Ala Thr Lys Asp Gly Lys 
355 360 365 

Lys Asp Lys Lys Glu Glu Asp Lys Lys 
370 375 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3841 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .3121 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION : 1 . . 3H4 1 

(D) OTHER INFORMATION: /note** *'GT24' 



SUBSTITUTE SHEET {RULE 26) 



wo 97/27296 



PCT/CA97/00051 



(xil SEQUENCE DESCRIPTION: 



-99- 

SEO ID N0:3: 



T TCA CAG CTC CCG GCC CGA GGC ACA CAA GCC CGA GST ACG GGC CAG 
Ser GXn Leu Pro Ala Arg Gly Thr Gin Ala Arg Xaa ?hr Gly Gin 
15 10 15 

5*^^ CGC GCC GGC CAC CTG GCG GGG CCC 

Ser Phe Ser Gin Glv Thr Thr Ser Arg Ala Gly His Leu Ala Gly Pro 
20 25 3& 

r^M n?^ E^*^ CCR CCG CGG GAG CCG TTC GCG CCC 

Glu Pro Ala Pro Pro Pro Pro Pro Xaa Pro Arg Glu Pro Phe Ala Pro 

40 45 

m rl^ '^'^^ CCC GAC GCG CCG CCC GCC GCC GCC 

Ser Leu Glv Ser Ala Phe His Leu Pro Asp Ala Pro Pro Ala Aia Alf 

55 60 

A?^ S?^ 5?? l^^ l^^ l^^ "^CC ACG CTG CCC GCG CCG CCG CGC 

Ala Ala Ala Leu Tyr Tyr Ser Xaa Ser Thr Leu Pro Ala Pro Pro Irg 



GGG 
Gly 



CAG 
Gin 



GGC 
Gly 



ACC 
Thr 



ATC 
lie 



CCC 
Pro 
160 

CCC 
Pro 



GGC 
Gly 



CGC 
Arg 



TCC 
Ser 



AGC 
Ser 



CGC 
Arg 
145 



TCC 
Ser 



GGC 
Gly 



TCG 
Ser 



TCG 
Ser 
130 

GTG 
Val 



CCG CTG GCC 
Pro Leu Ala 
85 

GGC TCG GCC 
Gly Ser Ala 
100 

CCC AAG CAG 
Pro Lys Gin 
115 

CCC ATC AAC 
Pro lie Asn 



ACC TCG CCC 
Thr Ser Pro 



i*^^ S?^ ^^'^ '^^^ CCC ACC AAG CTG 

Ala Pro Gin Gly Gly Ser Pro Thr Lys Leu 

96 ' 95 

CCC GAG GGC GCC ACC TAC GCC GCG CCG CGC 

Pro Glu Gly Ala Thr Tyr Ala Ala Pro Arg 

105 no ^ 

TCG CCC AGC CGC CTG GCC AAG TCC ^TAC AGC 

Ser Pro Ser Arg Leu Ala Lys Ser Tyr Ser 
120 125 

tT^ ?J9 ?J9 '^^^ ''CG GCC GGC CTG TCC CCG 

Ser Ala Gly Leu Ser Pro 
135 14& 

CCC ACC GTG CAG TCC ACC ATC TCC TCC TCG 

Pro Thr Val Gin Ser Thr He Ser Ser Ser 
150 155 



ATC CAC 
He His 



ACC AAG 
Thr Lys 



CAG 
Gin 



GCT 
Ala 



GAG CTG 
Glu Leu 



GGT TCC 
Gly Ser 
210 



CAG CTG AGC TCC ACC 
Gin Leu Ser Ser Thr 
165 

CGC CTG GTC CAC GCG 
Arg Leu Val His Ala 
IBO 

TAT GCC ACG GCC ACC 
Tyr Ala Thr Ala Thr 



Tyr 
195 



CGA GCC TCA TAC AGC 
Arg Ala Ser Tyr Ser 
215 



ATC GGC ACG TAC GCC ACC CTG TCG 

He Gly Thr Tyr Aia Thr Leu Ser 

no 175 

TCC GAG CAG TAC AGC AAG CAC TCG 

Ser Glu Gin Tyr Ser Lys His Ser 
185 190 

CTC CAG AGG CCG GGC AGC CTG GCA 

Leu Gin Arg Pro Gly Ser Leu Ala 
200 205 

AGC CAG CAT GGG CAC CTG GGC CCA 

Ser Gin His Gly His Leu Gly Pro 
220 



GAG TTG CGG GCC CTG CAG TCC CCA GAA CAC CAC ATA GAT CCC ATC TAT 
Glu Leu Arg Ala Leu Gin Ser Pro Glu His His lie Asp Pro lie lyl 



GAA GAC CGC 
Glu Asp Arg 
240 

CAG GGG GAC 
Gin Gly Asp 



ACA GCC CCA 
Thr Ala Pro 



GGC AGC CAG 
Gly Ser Gin 
290 

GCC AGC TAT 
Ala Ser Tyr 
305 



GTC TAT CAG 
Val Tyr Gin 
245 

CCT CTG CCG 
Pro Leu Pro 
260 

TCT TCC CCT 
Ser Ser Pro 
275 

CAC GGC CCA 
His Gly Pro 



GCC GCC GGC 
Aia Ala Gly 



AAG CCC CCT ATG ACG AGT CTC AGC CAG AGC 

Lys Pro Pro Met Arg ser Leu Ser Gin Ser 

250 255 

Irn S^^ SS^ S?^ '^^^ CGC ACG AGC 

Pro Ala His Thr Gly Thr Tyr Arg Thr Ser 

265 270 

r?J S^f *^^C CCC TTG CAG CGC ACA 
28§ ^^"^ 285 

rt« 5?^ "^CG GCC ACC TTC CAG AGG 

Gin Asn Ala Aia Ala Aia Thr Phe Gin Arq 

295 300 y 

S^o "^^C GCG GAC CCC TAC CGA 

Pro Ala Ser Asn Tyr Ala Asp Pro Tyr Arg 



CAG CTG 
Gin Leu 
320 

CCT GCT 
Pro Ala 



AGC ATT 
Ser lie 



CCG GAA 



CAG TAT TGT CCC TCT GTT 
Gin Tyr Cys Pro Ser Val 
325 

CTC CCG CCT GAA GGC ACC 
Leu Pro Pro Glu Gly Thr 
340 

CAG AAA GAT CCC AGA GAA 
3^1 



GAG TCT CCA TAC AGC AAA TCC GGC 

Glu Ser Pro Tyr Ser Lys Ser Gly 
330 

TTG GCC AGG TCC CCG TCC ATT GAT 

Leu Ala Arg Ser Pro Ser He Asd 
345 350 

TTT GGA TGG AGA GAC CCG GAA CTG 
Phe Gly Trp Arg Asp Pro Glu Leu 
360 365 



GTG ATT CAG ATG TTG CAG CAC CAG TTT CCC TCG GTC CAG TCT 



46 



94 



142 



190 



238 



286 



334 



382 



430 



478 



526 



574 



622 



670 



718 



766 



814 



862 



910 



958 



1006 



1054 



1 102 



H50 
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Pro Glu val lie Gin Met Leu Gin His Gin Phe Pro Ser Val Gin Ser 
370 375 380 

AAC GCG GCA GCC TAC TTG CAA CAC CTC TGT TTT GGA GAC AAC AAA ATT 

Asn Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn Lys lie 

385 390 395 

AAA GCC GAG ATA AGG AGA CAA GGA GGC ATC CAG CTC CTG GIG GAC CTG 

Lys Ala Glu He Arg Arq Gin Gly Gly He Gin Leu Leu Val Asp Leu 

400 405 410 415 

TTG GAT CAT CGG ATG ACC GAA GTC CAC CGT AGT GCC TGT GGA GCT CTG 

Leu Asp His Arg Met Thr Glu Val His Arg Ser Ala Cys Gly Ala Leu 
420 425 430 

AGA AAC CTG GTG TAT GGG AAG GCC AAC GAT GAT AAC AAA ATT GCC CTG 

Arg Asn Leu Val Tyr Gly Lys Ala Asn Asp Asp Asn Lys He Ala Leu 
435 440 445 

AAA AAC TGT GGT GGC ATC CCA GCA CTG GTG AGG TTA CTC CGC AAG ACG 

Lys Asn Cys Gly Gly He Pro Ala Leu Val Arq Leu Leu Arg Lys Thr 
450 455 460 

ACT GAC CTG GAG ATC CGG GAG CTG GTC ACA GGA GTC CTT TGG AAC CTC 

Thr Asp Leu Glu He Arg Glu Leu Val Thr Gly Val Leu Trp Asn Leu 

465 470 475 



TCC TCA TGC GAT GCA CTC 

Ser Ser Cys Asp Ala Leu 

480 485 

GTA CTG ACC AAC GCG GTG 

Val Leu Thr Asn Ala Val 
500 

CCT CTT CAG GAT GAT CGG 

Pro Leu Gin Asg Asp Arg 

CGT AAC GCC ACC GGG TGC 

Arg Asn Ala Thr Gly Cys 
530 



AAA ATG CCA ATC ATC CAG GAT GCC CTA GCA 
Lys Met Pro He He Gin Asp Ala Leu Ala 
490 - 495 

ATT ATC CCC CAC TCA GGC TGG GAA AAT TCG 
He He Pro His Ser Gly Trp Glu Asn Ser 
505 510 

AAA ATA CAG CTG CAT TCA TCA CAG GTG CTG 
Lys He Gin Leu His Ser Ser Gin Val Leu 
520 525 

CTA AGG AAT GTT AGT TCG GCC GGA GAG GAG 
Leu Arg Asn Val Ser Ser Ala Gly Glu Glu 
535 540 



GCC CGC AGA AGG ATG AGA GAG TGT GAT GGG CTT ACG GAT GCC TTG CTG 

Ala Arq Arg Arg Met Arg Glu Cys Asp Gly Leu Thr Asp Ala Leu Leu 

545 550 555 

TAC GTG ATC CAG TCT GCG CTG GGG AGC AGT GAG ATC GAT AGC AAG ACC 

Tyr Val He Gin Ser Ala Leu Gly Ser Ser Glu He Asp Ser Lys Thr 



565 



570 



575 



GTT GAA AAC TGT GTG TGC ATT TTA AGG AAC CTC TCG TAC CGG CTG GCG 

Val Glu Asn Cys Val Cys He Leu Arg Asn Leu Ser Tyr Arg Leu Ala 
580 585 590 

GCA GAA ACG TCT CAG GGA CAG CAC ATG GGC ACG GAC GAG CTG GAC GGG 

Ala Glu Thr Ser Gin Gly Gin His Met Gly Thr Asp Glu Leu Asp Glv 
595 600 605 

CTA CTC TGT GGC GAG GCC AAT GGC AAG GAT GCT GAG AGC TCT GGG TGC 

Leu Leu Cys Gly Glu Ala Asn Gly Lys Asp Ala Glu Ser Ser Gly Cys 

610 615 620 

TGG GGC AAG AAG AAG AAG AAA AAG AAA TCC CAA GAT CAG TGG GAT GGA 

Trp Gl^ Lys Lys Lys Lys L^s Lys Lys Ser Gin As| Gin Trp Asp Gly 

GTA GGA CCT CTT CCA GAC TGT GCT 

Val Gly Pro Leu Pro Asp Cys Ala 

640 645 

CTG TGG CAC CCA TCA ATA GTC AAA 

Leu Trp His Pro Ser He Val Lys 
660 

TGC TCA AAT CCA GAC ACG CTG GAA 

Cys Ser Asn Pro Asp Thr Leu Glu 
675 

TTG GCT GCA GGG AGC TGG AAG TGG 

Leu Ala Ala Gly Ser Trp Lys Trp 

690 695. 

CGA AAA GAG AAA GGC CTG CCC ATC 

Arg Lys Glu Lys Gly Leu Pro He 

705 710 

AAT GAC CGT GTG GTG TGC GCG GTG 

Asn Asp Arg Val Val Cys Ala Val 

720 755 

TTG GAC GTC AGA AAT AAG GAG CTC 

Leu Asp Val Arg Asn Lys Glu Leu 
740 



GAA CCA CCA AAA GGG ATC CAG ATG 

Glu Pro Pro Lys Gly He Gin Met 

650 655 

CCC TAC CTC ACA CTG CTC TCT GAG 

Pro Tyr Leu Thr Leu Leu Ser Glu 
665 .670 

GGG GCG GCA GGC GCC CTG CAG AAC 

Gly Ala Ala Gly Ala Leu Gin Asn 
680 685 

TCA GTA TAT ATC CGA GCC GCT GTC 

Ser Val Tyr He Arg Ala Ala Val 
700 

CTC GTG GAG CTG CTC CGA ATA GAC 

Leu Val Glu Leu Leu Arg He Asp 
715 

GCC ACT GCG CTG CGG AAC ATG GCC 

Ala Thr Ala Leu Arg Asn Met Ala 

730 735 

ATC GGC AAA TAC GCC ATG CGA GAC 

He Gly Lys Tyr Ala Met Arg Asp 
15 750 



1196 



1246 



1294 



1342 



1390 



1438 



1466 



1534 



1582 



1630 



1678 



1726 



1774 



1822 



1870 



1918 



1966 



2014 



2062 



2110 



2158 



2206 



2 254 



74i 
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AGC AAC AAC 

Ser Asn Asn 

765 

fSc f?^ 5^^ I^^ '^CA GTG ACA GCT GTC TGC TGC ACA Crr rnr 

Lys Ala Met Ser Asp Asp Thr Val Thr Ala Val l^s 6vs ?hJ LeS Hif 



-101- 

£rs v.f in j|| £n f^; i?; jjs ;« .« «„ .„ 

'-'J 760 Tfii; 



2350 



voA ini: rtia vai Cys 

tit fi! Ji.' ?£s tr. s.;f sj; s;; in j;s s;; i:; j;; 

ff| 155 IK n; si; m nt is Jj; «s; j;; jj. a... 

«5 in ??s j?j s:f j;s j;; n; c.! ;„ cje j;c 

m in tv^ tn tit if; SST EIS j;; - 1;; 

??; sj; 5j: sif is? i|| n; ?s? jis s?s |?| sis jfs it; "»« 
}fl Sfs K? is; i;; s;^ ;;s m p; ;;: ?:;.sfs ?if 
ni j;; sjs j;s n; ;;• ;;; tv. ;;: ir, tit s:? 

890 

Hi s;; e:s k; j;i k; s;; k; sjs jj; ;s; s;; s« j;s 
j;; ;s; u; s?| s;; s; j tit i;; |}j st; ;ji m ssi |.. 
sj; ?s; |;i i?; ;;s i;; ;n fs; iis ;j; ;=« 

s;; ;;; s;: ttt ttt tn ttt st; m sii i;j s;: s;; 
Sfs SfS IK ;;s s;; };; j;; s;; ;« sj= «s ;« „. p. 
§ts j:; n; ?Sf !|| s;; s}; j;[ ;;c ,.c c.. 



2734 



2782 



2830 



2878 



2926 



2974 



970 - 

TTC TTC GAG GAC CAG 
985 '^^^ 

a; ;;; ;;: ;;; tii ir. s;s ;;;;£; £i; -5 j.; 3... 



3070 



tXi^*"^ ^eu Gly 

1000 1005 

ll'r 511 JS^^?- .c. c„ ccc ccr^ccc ..c .c. 

£I|s^- fsL"- fsL^^V?^^ I^^ 3.3 

GTG TGAGGAGCAG GGCACAGGCG CTCCGGGAAA CAGTGCATGT GCATCCATAC 3,7, 
1040 '■ ^ 

CACAAGACAT TTCTTTCTGT TTTCGTTTTT TTCTCCTGCA AATTTAGTTT GTTAAAGCCT 
GTTCCATAGG AAGGCTGTGA TAACCAGTAA GGGAAATATT AAGAGCTATT TTAGAAAGCT 
AAATGAATCG CAAGTTAACT TGGAAATCAG lAGAAAGCTA AAGTGATCCT AAATATGACA 
GTGGGCAGCA CCTTICCTAG CGTGTTNTGT TAGGAGTAAC GAGAAGTGCI TTATACTGAA 
CGTGGGTTGN TTGGTAGGGT GGAGNCGAGG CATTCGGGCC GGTGGGGCGT AAGGGTTATC 
GTTAAGCACA AGACACAGAA TAGTTTACAC ACTGTGTGGG GGACGGCTTC TCACGCTTTG 
TTTACTCTCT TCATCCGTTG TGACTCTAGG CTTCAGGTIG CATrGGGGTT CCTCTGIACA 
GCAAGATGTT TCTTGCCTTT TGTTAATGCA TTGTTGTAAA GTATTTGATG TACATTACAG 
ATTAAAGAAG NAAAGCGCGT TGTGTATATT ACACCAATNC CGCCGTGTTT CCTCATCTAT 
GGTTCTAAAT ATTGCTTCAA TTTrwaarTT 

^^liCAA .TTCNAACTT TTGAAAGATG TATGGATTTC CAGTTTTTCT 

SUBSTrrUTE SHEET (RULE 26) 



3231 
3291 
3351 
3411 
34 71 
3531 
3591 
3651 
3711 
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TTACTTTCTC CCAGTATGTT TTAACCNMMN AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 3831 
AAAACTCGAC 3841 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 arolno acids 
IB) TYPE: amino acid 
(Dl TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ser Gin Leu Pro Ala Arg Gly Thr Gin Ala Arq Xaa Thr Gly Gin Ser 
15 10 15 

Phe Ser Gin Gly Thr Thr Ser Arg Ala Gly His Leu Ala Gly Pro Glu 
20 25 30 

Pro Ala Pro Pro Pro Pro Pro Xaa Pro Arg Glu Pro Phe Ala Pro Ser 
35 40 45 

Leu Gly Ser Ala Phe His Leu Pro Asp Ala Pro Pro Ala Ala Ala Ala 
50 55 60 

Ala Ala Leu Tyr Tyr Ser Xaa Ser Thr Leu Pro Ala Pro Pro Arg Gly 
65 70 75 80 

Gly Ser Pro Leu Ala Ala Pro Gin Gly Gly Ser Pro Thr Lys Leu Gin 
85 90 95 

Arg Gly Gly Ser Ala Pro Glu Gly Ala Thr Tyr Ala Ala Pro Arg Gly 
100 105 110 

Ser Ser Pro Lys Gin Ser Pro Ser Arg Leu Ala Lys Ser Tyr Ser Thr 
115 120 125 

Ser Ser Pro lie Asn He Val Val Ser Ser Ala Gly Leu Ser Pro He 
130 135 140 

Arg Val Thr Ser Pro Pro Thr Val Gin Ser Thr He Ser Ser Ser Pro 



14? 150 155 160 

He His Gin Leu Ser Ser Thr He Gly Thr Tyr Ala Thr Leu Ser Pro 
165 170 175 

Thr Lys Arg Leu Val His Ala Ser Glu Gin Tyr Ser Lys His Ser Gin 
180 185 190 

Glu Leu Tyr Ala Thr Ala Thr Leu Gin Arg Pro Gly Ser Leu Ala Ala 
195 200 205 

Gly Ser Arg Ala Ser Tyr Ser Ser Gin His Gly His Leu Gly Pro Glu 
210 215 220 

Leu Arg Ala Leu Gin Ser Pro Glu His His He Asp Pro He Tyr Glu 
225 230 235 240 

Asp Arg Val Tyr Gin Lys Pro Pro Met Arg Ser Leu Ser Gin Ser Gin 
245 256 255 

Gly Asp Pro Leu Pro Pro Ala His Thr Gly Thr Tyr Arg Thr Ser Thr 
260 265 270 

Ala Pro Ser Ser Pro Gly Val Asp Ser Val Pro Leu Gin Arg Thr Gly 
275 280 285 

Ser Gin His Gly Pro Gin Asn Ala Ala Ala Ala Thr Phe Gin Arg Ala 
290 295 300 

Ser Tyr Ala Ala Gly Pro Ala Ser Asn Tyr Ala Asp Pro Tyr Arg Gin 
305 . 310 315 / ^ 

Leu Gin Tyr Cys Pro Ser Val Glu Ser Pro Tyr Ser Lys Ser Gly Pro 
325 330 335 

Ala Leu Pro Pro Glu Gly Thr Leu Ala Arg Ser Pro Ser He Asp Ser 
340 ,345 350 

He Gin Lys Asp Pro Arg Glu Phe Gly Trp Arg Asp Pro Glu Leu Pro 
355 360 365 

Glu Val He Gin Met Leu Gin His Gin Phe Pro Ser Val Gin Ser Asn 
370 375 380 

Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn Lys He Lys 
385 390 395 400 

Ala Glu He Arg Arq Gin Gly Gly He Gin Leu Leu Val Asp Leu Leu 
405 410 415 
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Asp His Arg M|t Thr Glu Val His Ar| Ser Ala Cys Gly Ala Leu Arg 

Asn Leu Val 7yr Gly Lys Ala Asn Asp Asp Asn Lys lie Ala Leu Lys 

Asn C^s Gly Gly He Pro Ala Leu Val Arg Leu Leu Arg Lys Thr Thr 

As| Leu Glu He Arg Glu' Leu Val Thr Gly Val Leu Trp Asn LeU Ser 
' " 475 (go 

ser cys Asp Ala Leu Lys Met Pro He lie Gin Asp Ala Leu Ala Val 

490 

Leu Thr Asn Ala Val He He Pro His Ser Gly Trp Glu Asn Ser Pro 

505 

Leu Gin Asp Asp Arg Lys He Gin Leu His Ser Ser Gin Val Leu Arq 

520 525 ^ 

Asn Ala Thr Gly Cys Leu Arq Asn Val Ser Ser Ala Gly Glu Glu Ala 

ajD 540 

Arq Arg Arg Met Arg Glu Cys Asp Gly Leu Thr Asp Ala Leu Leu Tyr 

555 5^0 
Val He Gin Ser Ala Leu Gly Ser Ser Glu He Asp Ser Lys Thr Val 



575 



Glu Asn Cys Val Cys He Leu Arg Asn Leu Ser Tyr Arg Leu Ala Ala 
'"U 585 590 

Glu Thr III Gin Gly Gin His Met Gly Thr Asp Glu Leu Asp Gly Leu 

Leu C|,s Gly Glu Ala Asn Gig Lys Asp Ala Glu |er Ser Gly Cys Trp 

Gig Lys Lys Lys Lys Lys Lys Lys Ser Gin Asp Gin Trp Asp Gly Val 

635 

Gly Pro Leu Pro Asp Cys Ala Glu Pro Pro Lys Gly He Gin Met Leu 

650 655 

Trp His Pro ser He Val Lys Pro Tvr Leu Thr Leu Leu Ser Glu Cys 

^ 665 

ser Asn Pro Asp Thr Leu Glu Glv Ala Ala Gly Ala Leu Gin Asn Leu 

"'^ 680 685 

Ala Ala Gly Ser Trp Lys Tr| Ser Val Tyr He Arg Ala Ala Val Arg 

7^5 110 "'^ *»P 

715 720 

Asp Arg val Val Cvs Ala Val Ala Thr Ala Leu Arg Asn Met Ala Leu 

730 735 

ASP Val Arg Asn Lys Glu Leu He Gig Lys Tyr Ala Met Arg Asp Leu 
Val His Arg Leu Pro Gly Gly Asn Asn Ser Asn Asn Thr Ala Ser Lys 
Ala Met Set Asp Asp Thr Val Thr Ala Val Cys C^s Thr Leu His Glu 

val He Thr Lys Asn Met Glu Asn Ala Lys Ala Leu Arg Asp Ala Gly 

795 Qoo 
Gly He Glu Lys Leu Val Gly He Ser L^s Ser Lys Gly Asp L^s His 

ser Pro Lys Val Val Lys Ala Ala |er Gin Val Leu Asn Ser Met Trp 

Gin Tyr Arg Asp Leu Arg s,r leu Tyr Lys Lys Asp Gig Trp Ser Gin 

Tyr His Phe Val Ala Ser Ser Ser Thr He Glu Arg lip Arg Gin Arg 

Pro Tyr Ser Ser Ser Arg Thr Pro Ser He Ser Pro Val Arg Val Ser 

0 '5 990 

pro Asn Asn Arg Ser Ala Ser Ala Pro Ala Ser Pro Arg Glu Met He 

8 SO 6 9 5 

ser Leu Lys Glu Arg Lys Thr Asp Tyr Glu Cys Thr Gly Ser Asn Ala 

3Ub 910 

Thr Tyr His Gly Gly Lys Gly Glu His Thr Ser Arg L^s Asp Ala Met 
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Thr Ala Gin Asn Thr Gly He Ser Thr Leu Tyr Arg Asn Ser Tvr Glv 
930 935 940 

Ala Pro Ala Glu Asp lie Lys His Asn Gin Val Ser Ala Gin Pro Val 
945 950 955 ggQ 

Pro Gin Glu Pro Ser Arg Lys Asp Tyr Glu Thr Tyr Gin Pro Phe Gin 
965 970 975 

Asn Ser Thr Arg Asn Tyr Asp Glu ser Phe Phe Glu Asp Gin Val His 
980 985 990 

His Arg Pro Pro Ala Ser Glu Tyr Thr Met His Leu Gly Leu Lys Ser 
995 1000 1005 

Thr Gly Asn Tyr Val Asp Phe Tyr Ser Ala Ala Arg Pro Tyr Ser Glu 
1010 1015 1020 

Leu Asn Tyr Glu Thr Ser His Tyr Pro Ala Ser Pro Asp Ser Trp Val 
1025 1030 1035 ^ 1040 

(2) INFORMATIOK FOR SEO ID NO: 5: 

(i) SEOaENCE CHARACTERISTICS: 

(A) LENGTH: 3907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 142 . . 3777 

(D) OTHER INFORMATION: /note= "p0071" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CTACTGTTGT TTTTGAGGGG CGGGCAGCCG CGCCGCCGCG GCACTTTTTT AATTTTTTCG 60 

GGTGCCGCAG CAGCGACCCC TCGGCGCCGA TGTCCCTGAT CCCTGGAGCG ACGACGGCCG 120 

CTGCCTAAGC TGGGAAGAGG A ATG CCA GCT CCT GAG CAG GCC TCA TTG GTG 17 1 

Met Pro Ala Pro Glu Gin Ala Ser Leu Val 
1 5 10 

GAG GAG GGG CAA CCA CAG ACC CGC CAG GAA GCT GCC TCC ACT GGC CCA 219 
Glu Glu Gly Gin Pro Gin Thr Arg Gin Glu Ala Ala Ser Thr Glv Pro 
15 20 25 

GGC ATG GAA CCC GAG ACC ACA GCC ACC ACT ATT CTA GCA TCC GTG AAG 267 
Gly Met Glu Pro Glu Thr Thr Ala Thr Thr He Leu Ala Ser Val Lys 
30 35 40 



GAG CAG GAG CTT CAG TTT CAG CGA CTC ACC CGA GAA CTG GAA GTG GAA 315 
Glu Gin Glu Leu Gin Phe Gin Arg Leu Thr Arg Glu Leu Glu Val Glu 
45 50 55 



AGG CAG ATT GTT GCC AGT CAG CTA GAA AGA TGT AGG CTT GGA GCA GAA 363 

Arg Gin He Val Ala Ser Gin Leu Glu Arg Cys Arg Leu Gly Ala Glu 

60 65 7 0 

TCA CCA AGC ATC GCC AGC ACC AGC TCA ACT GAG AAG TCA TTT CCT TGG 411 

Ser Pro Ser He Ala Ser Thr Ser Ser Thr Glu Lys Ser Phe Pro Tro 

75 80 85 90 

AGA TCA ACA GAC GTG CCA AAT ACT GGT GTA AGC AAA CCT AGA GTT TCT 459 

Arg Ser Thr Asp Val Pro Asn Thr Gly Val Ser Lys Pro Arg val Ser 
95 100 105 

GAC GCT GTC CAG CCC AAC AAC TAT CTC ATC AGG ACA GAG CCA GAA CAA 507 

Asp Ala Val Gin Pro Asn Asn Tyr Leu He Arg Thr Glu Pro Glu Gin 
110 115 120 

GGA ACC CTC TAT TCA CCA GAA CAG ACA TCT CTC CAT GAA AGT GAG GGA 555 

Gly Thr Leu Tyr Ser Pro Glu Gin Thr Ser Leu His Glu Ser Glu Gly 

125 130 135 

TCA TTG GGT AAC TCA AGA AGT TCA ACA CAA ATG AAT TCT TAT TCC GAC 603 

Ser Leu Gly Asn Ser Arg Ser Ser Thr Gin Met Asn Ser Tyr Ser Asp 

140 145 150 

AGT GGA TAC CAG GAA GCA GGG AGT TTC CAC AAC AGC CAG AAC GTG AGC 651 

Ser Gly Tyr Gin Glu Ala Gly Ser Phe His Asn Ser Gin Asn Val Ser 

155 160 165 170 

AAG GCA GAC AAC AGA CAG CAG CAT TCA TTC ATA GGA TCA ACT AAC AAC 699 

Lys Ala Asp Asn Arq Gin Gin His Ser Phe He Gly Ser Thr Asn Asn 
175 180 185 

CAT GTG GTG AGG AAT TCA AGA GCT GAA GGA CAA ACA CTG GTT CAG CCA 74 7 

Hxs Val Val Arg A-sn Ser Arg Ala Glu Gly Gin Thr Leu Val Gin Pro 
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ISO 195 200 

TCA GTA GCC AAT CGG GCC ATG AGA AGA GTT AGT TCA GTT CCA TCT AGA 795 

Ser Val Ala Asn Arg Ala Met Arq Arg Val Ser Ser Val Pro Ser Arc 

205 210 215 

GCA CAG TCT CCT TCT TAT GTT ATC AGC ACA GGC GTG TCT CCT TCA AGG 843 

Ala Gin ser Pro Ser Tyr Val lie ser Thr Gly Val Ser Pro Ser Arg 

220 225 230 

GGG TCT CTG AGA ACT TCT CTG GGT AGT GGA TTT GGC TCT CCG TCA GTG flQl 

Glv ser Leu Arg Thr Ser Leu Gly Ser Gly Phe Gly Ser Pro Ser Val 

235 240 7A^ -jcA 



220 225 ' 230 

TTT GGC 

Phe Gly 

245 250 

ACC GAC CCC CGA CCT CTG AAC CCC AGT GCA TAT TCC TCC ACC ACA TTA Q5q 
Thr Asp Pro Arg Pro Leu Asn Pro Ser Ala Tyr Ser Ser Thr Thr Leu 
255 260 265 

i^l ^fl ff'^ I^^ TCA CAG AGA CCC GCC TCC CCA 987 

Pro Ala Ala Arg Ala Ala Ser Pro Tyr Ser Gin Arg Pro Ala Ser Pro 
270 275 280 

ff^ yl^ '^^^ ^"^^ '^CC CGG CAG ACC TCC AAT 1035 

Thr Ala lie Arg Arg He Gly Ser Val Thr Ser Arg Gin Thr Ser Asn 

2B5 290 295 

r?* SE^ S^'' ACC GCC AGA GTG GGG TCC 1083 

Pro Asn Gly Pro Thr Pro Gin Tyr Gin Thr Thr Ala Arg Val Gly Ser 

300 305 310 

CCA CTG ACC CTG ACG GAT GCA CAG ACT CGA GTA GCT TCC CCA TCC CAA 1131 
Pro Leu Thr Leu Thr Asp Ala Gin Thr Arg Val Ala Ser Pro Ser Gin 
320 325 -i-jn 



325 330 



r?S r^^n SI? I^^ I^^ AAA CGC TCA GGG ATG ACC GCC GTA 1179 

Gly Gin Val Gly Ser Ser Ser Pro Lys Aro Ser Gly Met Thr Ala Val 

335 340 345 

CCA CAG CAT CTG GGA CCT TCA CTG CAA AGG ACT GTT CAT GAC ATG GAG 1227 

Pro Gin Hxs Leu Gly Pro Ser Leu Gin Arg Thr Val His Asp Met Glu 

350 355 260 

G?S PhP r?v r^n r^^ l^'^ ^T^ '^^'^ ^'^^ ^"^^ GTT CCA CCC 1275 

Gin Phe Gly Gin Gin Gin Tyr Asp He Tyr Glu Arg Met Val Pro Pro 

Jb5 370 375 

AGG CCA GAC AGC CTG ACA GGC TTA CGG AGT TCC TAT GCT AGT CAG CAT 

Arg Pro Asp Ser Leu Thr Gl^ Leu Arg Ser Ser T^r Ala Ser Gin Hil 

AGT CAG CTT GGG CAA GAC CTT CGT TCT GCC GTG TCT CCC GAC TTG CAC 

Ser Gin Leu Gly Gin Asp Leu Arg Ser Ala Val Ser Pro Asp Lei His 

•^^^ 400 405 410 
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1371 



ill tF^ S?^ ACC TAT TAC AGC CCA GTG TAC CGC 1419 

He Thr Pro He Tyr Glu Gly Arg Thr Tvr Tyr Ser Pro Val Tyr Arg 
415 420 455 ^ 



1467 



AGC CCA AAC CAT GGA ACT GTG GAG CTC CAA GGA TCG CAG ACG GCG TTG 

Ser Pro Asn His Gly Thr Val Glu Leu Gin Gly ler Gin ?hr Ala lln 

435 

Ivl Irl Thr afl vll m r?' ^f* "* *5G ACA TCC AGC 1515 

Tyr Arg Thr Gly Val Ser Gly lie Gly Asn Leu Gin Arg Thr Ser Ser 

450 455" 



1707 



CAA CGA AGT ACC CTT ACA TAC CAA AGA AAT AAT TAT GCT CTG AAC ACA Tifil 
Gin Arg Ser Thr Leu Thr Tgr Gin Arg Asn Asn 7^ Ala Leu A?n t" 

ACA GCT ACC TAC GCG GAG CCC TAC AGG CCT ATA CAA TAC cca rrr r-ii» i<:,, 
Thr Ala Thr Tyr Ala Glu Pro Tyr Arg p" lie GlS Tyr a" vll GlS ^"^ 

i?S irs Sts Ill j;- III SI? til t-^i III g« 

ir, m irr jis III in s?^ ijs 

515 520 

TTT GCC TGG CGT GAT CCT GAG TTG CCT GAG GTC ATT CAC ATG CTT rar if^a 
Phe Ala Tro Arg Asp Pro Glu Leu Pro Glu vIi lU Sis Sit III gJS ''^^ 

530 53 5 

III III il^o HI 511 m tl^. III III CTG CAG CAC ie03 

III irs III tti i^si III tr. §1? i?f tilt III SI? ir. sf? ^ 

GGA ATC AAG CAT CTG GTT GAC CTT CTG GAC CAC AGA GTT TTG GAA GTT 



1899 
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Gly lie Lys His Leu Val Asp Leu Leu Asp His Arg Val Leu Glu Val 
5/5 580 585 

CAG AAG AAT GCT TGT CGT GCC CTT CGA AAC CTC GTT TTT GGC AAG TCT 1947 
Gin Lys Asn Ala Cys Gly Ala Leu Arg Asn Leu Val Phe Gly Lys Ser 
590 595 600 

ACA GAT GAA AAT AAA ATA GCA ATG AAG AAT GTT GGT G6G ATA CCT GCC 1995 
Thr Asp Glu Asn Lys He Ala Met Lys Asn Val Gly Gly He Pro Ala 
605 610 615 

TTG TTG CGA CTG TTG AGA AAA TCT ATT GAT GCA GAA GTA AGG GAG CTT 2043 
Leu Leu Arg Leu Leu Arg Lys Ser He Asp Ala Glu Val Arg Glu Leu 
620 625 630 

S^7 SS'^ S^T ^'^^ ^^'^ '^^^ "^^^ TGT GAT GCT GTA AAA ATG 2091 

Val Thr Gly Val Leu Trp Asn Leu Ser Ser Cys Asp Ala Val Lys Met 
635 640 645 650 

ACA ATC ATT CGA GAT GCT CTC TCA ACC TTA ACA AAC ACT GTG ATT GTT 2139 
Thr He He Arg Asp Ala Leu Ser Thr Leu Thr Asn Thr Val He Val 
655 660 665 

CCA CAT TCT GGA TGG AAT AAC TCT TCT TTT GAT GAT GAT CAT AAA ATT 2187 
Pro His Ser Gly Trp Asn Asn Ser Ser Phe Asp Asp Asp His Lys He 
670 67 5 680 

AAA TTT CAG ACT TCA CTA GTT CTG CGT AAC ACG ACA GGT TGC CTA AGG 2235 
Lys Phe Gin Thr Ser Leu Val Leu Arg Asn Thr Thr Gly Cys Leu Ara 
685 690 695 - 

AAC CTC ACG TCC GCG GGG GAA GAA GCT CGG AAG CAA ATG CGG TCC TGC 2283 
Asn Leu Thr Ser Ala Gly Glu Glu Ala Arg Lys Gin Met Arg Ser Cvs 
700 705 710 

GAG GGG CTG GTA GAC TCA CTG TTG TAT GTG ATC CAC ACG TGT GTG AAC 2331 
Glu Gly Leu Val Asp Ser Leu Leu Tyr Val He His Thr Cys Val Asn 
^15 720 725 730 

ACA TCC GAT TAC GAC AGC AAG ACG GTG GAG AAC TGC GTG TGC ACC CTG 2379 
Thr Ser Asp Tyr Asp Ser Lys Thr Val Glu Asn Cys Val Cys Thr Leu 
735 740 745 

AGG AAC CTG TCC TAT CGG CTG GAG CTG GAG GTG CCC CAG GCC CGG TTA 2427 
Arg Asn Leu Ser Tyr Arg Leu Glu Leu Glu Val Pro Gin Ala Arg Leu 
750 755 760 

CTG GGA CTG AAC GAA TTG GAT GAC TTA CTA GGA AAA GAG TCT CCC AGC 2 47 5 

Leu Gly Leu Asn Glu Leu Asp Asp Leu Leu Gly Lys Glu Ser Pro Ser 
765 770 775 

AAA GAC TCT GAG CCA AGT TGC TGG GGG AAG AAG AAG AAA AAG AAA AAG 2 52 3 

78? 7?! 7^0 

AGG ACT CCG CAA GAA GAT CAA TGG GAT GGA GTT GGT CCT ATC CCA GGA 2571 
Arg Thr Pro Gin Glu Asp Gin Trp Asp Gly Val Gly Pro He Pro Glv 
795 800 805 810 

CTG TCG AAG TCC CCC AAA GGG GTT GAG ATG CTG TGG CAC CCA TCG GTG 2619 
Leu Ser Lys Ser Pro Lys Gly Val Glu Met Leu Trp His Pro Ser Val 
815 820 825 

GTA AAA CCA TAT CTG ACT CTT CTA GCA GAA AGT TCC AAC CCA GCC ACC 2667 
Val Lys Pro Tvr Leu Thr Leu Leu Ala Glu Ser Ser Asn Pro Ala Thr 
830 835 840 

TTG GAA GGC TCT GCA GGG TCT CTC CAG AAC CTC TCT GCT AGC AAC TGG 2715 
Leu Glu Gly Ser Ala Gly Ser Leu Gin Asn Leu Ser Ala Ser Asn Trp 
845 850 855 

AAG TTT GCA GCA TAT ATC CGG GGC GGC CGT CCG AAA AGA AAA GGG CTC 2763 
Lys Phe Ala Ala Tyr He Arg Gly Gly Arg Pro Lys Arg Lys Gly Leu - 
8 60 865 870 

CCC ATC CTT GTG GAG CTT CTG AGA ATG GAT AAC GAT AGA GTT GTT TCT 2811 
Pro He Leu Val Glu Leu Leu Arg Met Asp Asn Asp Arg Val Val Ser 
fi75 880 885 890 

TCC GGT GCA ACA GCC TTG AGG AAT ATG GCA CTA GAT GTT CGC AAC AAG 2859 
Ser Gly Ala Thr Ala Leu Arg Asn Met Ala Leu Asp Val Arg Asn Lys 
B95 900 905 

GAG CTC ATA GGC AAA TAC GCC ATG CGA GAC CTG GTC AAC CGG CTC CCC 2907 
Glu Leu He Gig Lys Tyr Ala Met Ar^ Asp Leu Val Asn Arg Leu Pro 



92C 



GGC GGC AAT GGC CCC AGT GTC TTG TCT GAT GAG ACC ATG GCA GCC ATC 2955 

Gly Gly Asn Gly Pro Ser Val Leu Ser Asp Glu Thr Met Ala Ala He 

925 930 935 

r^^ l^l 9'^^ 9.^^ '^GC AAA AAC ATG GAG AAC GCA AAA 3003 

Cys Cys Ala Leu His Glu Val Thr Ser Lys Asn Met Glu Asn Ala Lys 

940 945 950 ' 
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III III Hi Ill -'?-]lS §1? J- ... 30.: 

^.r. i^? Ill III sil m tn k° 

- c.. .cc j:: ... 

i?^V^- - ;JL- - ?^L^^°°"- 

^l^^ CO, .CC „0 .C. jcT^CC ..C C.. 3.3 

U^r lil ni lllltt I- 51$ - isTi^J if? ifj 3as: 



GAATGAATGA AGTGTGTTTT TTTTTCCTTT TTGAGGAATT ATCAGGGGAA TTCGATATCA 
AGCTTATCGA TACCGTCGAC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID 1)0:6: 
Met Pro Ala Pro Glu Gin Ala Ser Leu Val Glu Glu Gly Gin P.o Gin 
Thr Arg Gin Glu Ala Ala Ser Thr Glj Pro Gly Met Glu Pro Gl'! Thr 
Thr Ala Thr Thr He Leu Ala Ser Val Lys Glu Gin Glu Vel Gin Phe 
Gin Arg Leu Thr Arg Glu Leu Glu Val Glu Arg Gin II! Val Ala Ser 



til III -2 CGC TCT^GAA TAC GAT AGG ACC^CAG 3333 

^^o^ 21? l^.V.l 5- «C CAA GGG GAT GCC ACA CAT, LTgGC 333, 

^» 1080 

£is Hi ss;^?;; itt ir. k; f^p ?n ;;s nun i« ;;j 3.„ 

!!s,'" J" sfi iJs.JJ: Js; £is iff jj"!?; st; k; 
f{L?" "I tit ss; jj? 

ns ?;j in in in it: Ki tj;;;; js: j.j i;: j.^ j;/ 
til sri s;: si^s?: m m js? j;^ „, jr.,. 3.., 

i-»-35 1X60 
?g?^$S? i^sl m 51? 15^7^1 iff S-^CGA CCT TCT 3.. 

ir. llltit ^fs^i- iff i^i^n^if A^^^ 
t^i its its sfgV- ifi III III u^^ui III m 

AGG TGA AAAGTCCATC TTGCTGATTT CATGATTGAA ATGTGAAAGT GAAGTGGAAg" 



3771 



3827 

3867 
3907 
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Gin Leu Glu Arg Cys Arq Leu Gly Ala Glu Ser Pro Ser lie Ala Ser 
65 70 75 80 

Thr Ser Ser Thr Glu Lys Ser Phe Pro Trp Arg Ser Thr Asp Val Pro 
85 90 ^ 95 

Asn Thr Gly val Ser Lys Pro Arg Val Ser Asp Ala Val Gin Pro Asn 
100 105 110 

Asn Tyr Leu lie Arg Thr Glu Pro Glu Gin Gly Thr Leu Tyr Ser Pro 
115 120 125 

Glu Gin Thr Ser Leu His Glu Ser Glu Gly Ser Leu Gly Asn Ser Aro 
130 135 140 

Ser Ser Thr Gin Met Asn Ser Tyr Ser Asp Ser Gly Tyr Gin Glu Ala 
145 150 155 160 

Gly Ser Phe His Asn Ser Gin Asn Val Ser Lys Ala Asp Asn Arg Gin 
165 170 175 

Gin His Ser Phe lie Gly Ser Thr Asn Asn His Val Val Arg Asn Ser 
190 185 190 

Arg Ala Glu Gly Gin Thr Leu Val Gin Pro Ser Val Ala Asn Arg Ala 
195 200 205 

Met Arg Arg Val Ser Ser Val Pro Ser Arg Ala Gin Ser Pro Ser Tvr 
210 215 220 

Val lie Ser Thr Gly Val Ser Pro Ser Arg Gly Ser Leu Arc Thr Ser 
225 230 23^ 240 

Leu Gly Ser Gly Phe Gly Ser Pro Ser Val Thr Asp Pro Arg Pro Leu 
245 250 255 

Asn Pro Ser Ala Tyr Ser Ser Thr Thr Leu Pro Ala Ala Arg Ala Ala 
260 265 270 

Ser Pro Tyr Ser Gin Arg Pro Ala Ser Pro Thr Ala lie Arg Arg lie 
275 2B0 285 

Gly Ser Val Thr Ser Arg Gin Thr Ser Asn Pro Asn Gly Pro Thr Pro 
290 295 300 

Gin Tyr Gin Thr Thr Ala Arg Val Gly Ser Pro Leu Thr Leu Thr Asp 
305 310 315 320 

Ala Gin Thr Arg Val Ala Ser Pro Ser Gin Gly Gin Val Gly Ser Ser 
325 330 335 

Ser Pro Lys Arg Ser Gly Met Thr Ala Val Pro Gin His Leu Gly Pro 
340 345 350 

Ser Leu Gin Arg Thr Val His Asp Met Glu Gin Phe Gly Gin Gin Gin 
355 360 365 

Tyr Asp lie Tyr Glu Arg Met Val Pro Pro Arg Pro Asp Ser Leu Thr 
370 375 380 

Gly Leu Arg Ser Ser Tvr Ala Ser Gin His Ser Gin Leu Gly Gin Asp 
385 390 395 400 

Leu Arg Ser Ala Val Ser Pro Asp Leu His lie Thr Pro lie Tyr Glu 
405 410 415 

Gly Arg Thr Tyr Tyr. Ser Pro Val Tyr Arg Ser Pro Asn His Gly Thr 
420 425 430 

Val Glu Leu Gin Gly Ser Gin Thr Ala Leu Tyr Arg Thr Gly Val Ser 
-435 440 445 

Gly He Gly Asn Leu Gin Arg Thr Ser Ser Gin Arg Ser Thr Leu Thr 
450 455 460 

Tyr Gin Arg Asn Asn Tyr Ala Leu Asn Thr Thr Ala Thr Tyr Ala Glu 
465 4^0 475 480 

Pro Tyr Arg Pro He Gin Tyr Arg Val Gin Glu Cys Asn Tyr Asn Arg 
485 490 495 

Leu Gin His Ala Val Pro Ala Asp Asp Gly Thr Thr Arg Ser Pro Ser 
500 505 510 

lie Asp Ser He Gin Lys Asp Pro Arg Glu Phe Ala Trp Arg Asp Pro 
515 520 525 

Glu Leu Pro Glu Val He His Met Leu Glu His Gin Phe Pro Ser Val 
530 ' 535 540 

Gin Ala Asn Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn 
545 550 555 ^ f 

Lys Val Lys Met Glu Val Cys Arg Leu Gly Gly He Lys His Leu Val 
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565 570 575 

Asp Leu Leu Asp His Arg Val Leu Glu Val Gin Lys Asn Ala Cys Glv 
580 585 590 

Ala Leu Arg Asn Leu Val Phe Gly Lys Ser Thr Asp Glu Asn Lys He 
595 600 605 

Ala Met Lys Asn Val Gly. Gly lie Pro Ala Leu Leu Arg Leu Leu Arg 
610 615 620 

Lys Ser He Asp Ala Glu Val Arg Glu Leu Val Thr Gly Val Leu Trp 
625 630 635 640 

Asn Leu Ser Ser Cys Asp Ala Val Lys Met Thr He He Arg Asp Ala 
645 650 655 

Leu Ser Thr Leu Thr Asn Thr Val He Val Pro His Ser Gly Trp Asn 
660 665 670 

Asn Ser Ser Phe Asp Asp Asp His Lys He Lys Phe Gin Thr Ser Leu 
675 680 685 

Val Leu Arg Asn Thr Thr Gly Cys Leu Arg Asn Leu Thr Ser Ala Glv 
690 695 700 

Glu Glu Ala Arg Lys Gin Met Arg Ser Cys Glu Gly Leu Val Asp Ser 
705 710 715 720 

Leu Leu Tyr Val He His Thr Cys Val Asn Thr Ser Asp Tyr J^sp Ser 
725 730 ' 735 

Lys Thr Val Glu Asn Cys Val Cys Thr Leu Arg Asn Leu Ser Tyr Arg 
740 745 750 

Leu Glu Leu Glu Val Pro Gin Ala Arg Leu Leu Gly Leu Asn Glu Leu 
755 760 765 

Asp Asp Leu Leu Gly Lys Glu Ser Pro Ser Lys Asp Ser Glu Pro Ser 
770 775 780 

CVS Trp Gly Lys Lys Lvs Lys Lys Lys Lys Arg Thr Pro Gin Glu Asp 
785 790 795 BOO 

Gin Trp Asp Gly Val Gly Pro He Pro Gly Leu Ser Lys Ser Pro Lys 
805 8l6 815 

Gly Val Glu Met Leu Trp His Pro Ser Val Val Lys Pro Tyr Leu Thr 
820 825 830 

Leu Leu Ala Glu Ser Ser Asn Pro Ala Thr Leu Glu Gly Ser Ala Gly 
835 840 845 

Ser Asn Trp Lys Phe Ala Ala Tyr He 
850 855 860 

Arg Gly Gly Arg Pro Lys Arg Lys Gly Leu Pro He Leu Val Glu Leu 
865 - 8^0 875 880 

Leu Arg Met Asp Asn Asp Arg Val Val Ser Ser Gly Ala Thr Ala Leu 
885 890 895 

Arg Asn Met Ala Leu Asp Val Arg Asn Lys Glu Leu He Gly Lys Tyr 
900 905 910 

Ala Met Arg Asp Leu Val Asn Arg Leu Pro Gly Gly Asn Gly Pro Ser 
915 920 925 

Val Leu Ser Asp Glu Thr Met Ala Ala He Cys Cys Ala Leu His Glu 
'JO 935 940 

Val Thr Ser Lys Asn Met Glu Asn Ala Lys Ala Leu Ala Asp Ser Gly 
950 955 9gg 

Gly He Glu Lys Leu Val Asn He Thr Lys Gly Arg Gly Asp Arg Ser 
965 970 975 

Ser Leu Lys Val Val Lys Ala Ala Ala Gin Val Leu Asn Thr Leu Trp ' 
980 985 990 

Gin Tyr Ara Asp Leu Arg Ser He Tyr Lys Lys Asp Gly Trp Asn Gin 

1000 1005 

1010^^* ^^"^ 1015^*'' '^^'^ 10?0*'^ ^'''^ 

Ser^His Pro Ser Leu |er^Thr Thr Asn Gin Gln^Met Ser Pro He He 

Gin Ser Val Gly Ser Thr Ser Ser Ser Pro Ala Leu Leu Gly He Arq 
1045 1050 1055 

Asp Pro Arg Ser Glu Tyr Asp Arg Thr Gin Pro Pro Met Gin Tyr Tyr 
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Asn Ser Gin Gly Asp Ala Thr His Lys Gly Leu Tyr Pro Gly Ser Ser 
1075 1080 1085 

fJS^^^"^ "^y*^ HSc^®^ '^y^ Ser Ser Pro Ala Arg Glu 

1090 1095 1100 

Gin Asn Arg Arg Leu Gin His Gin Gin Leu Tyr Tyr Ser Gin Asp Asp 
1105 _ 1110 lll5 1120 

Ser Asn Arg Lys Asn Phe Asp Ala Tyr Arg Leu Tyr Leu Gin Ser Pro 
1125 1130 1135 

His Ser Tyr Glu Asp Pro Tyr Phe Asp Asp Arg Val His Phe Pro Ala 
1140 1145 1150 

Ser Thr Asp Tyr Ser Thr Gin Tyr Gly Leu Lys Ser Thr Thr Asn Tvr 
1155 1160 1165 

"^y^ '^^^ Ser Tyr Arg Ala Glu Gin Tvr 

1170 1175 1180 

Pro Gly Ser Pro Asp Ser Trp Val Tyr Asp Gin Asp Ala Gin Gin Arg 
1195 1190 1195 1200 

Asn Ser Phe Phe Leu Thr Leu Phe Arg Leu Arg 
1205 1210 

(2) INFORMATION FOR SEQ ID N0:7; 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 970 base pairs 
IB) TYPE: nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: linear 



iin) FEATURE: 

(A) NAME/KEY: misc feature 
(B> LOCATION: 1..970 

(D) OTHER INFORMATION: /note- "Y2H9- 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 



GAATTCCCAC 


AGATACCACT 


GCTGCTCCCG 


CCCTTTCGCT 


CCTCGGCCGC 


GCAATGGGCA 


60 


CCCGCGACGA 


CGAGTACGAC 


TACCTCTTTA 


AAGTTGTCCT 


TATTGGAGAT 


TCTGGTGTTG 


120 


GAAAGAGTAA 


TCTCCTGTCT 


CGATTTACTC 


GAAATGAGTT 


TAATCTGGAA 


AGCAAGAGCA 


180 


CCATTGGAGT 


AGAGTTTGCA 


ACAAGAAGCA 


TCCAGGTTGA 


TGGAAAAACA 


ATAAAGGCAC 


240 


AGATATGGGA 


CACAGCAGGG 


CAAGAGCGAT 


ATCGAGCTAT 


AACATCAGCA 


TATTATCGTG 


300 


GAGCTGTAGG 


TGCCTTATTG 


GTTTATGACA 


TTGCTAAACA 


TCTCACATAT 


GAAAATGTAG 


360 


AGCGATGGCT 


GAAAGAACTG 


AGAGATCATG 


CTGATAGTAA 


CATTGTTATC 


ATGCTTGTGG 


420 


GCAATAAGAG 


TGATCTACGT 


CATCTCAGGG 


CAGTTCCTAC 


AGATGAAGCA 


AGAGCTTTTG 


480 


CAGAAAAGAA 


TGGTTTGTCA 


TTCATTGAAA 


CTTCGGCCCT 


AGACTCTACA 


AATGTAGAAG 


540 


CTGCTTTTCA 


GACAATTTTA 


ACAGAGATTT 


ACCGCATTGT 


TTCTCAGAAG 


CAAATGTCAG 


600 


ACAGACGCGA 


AAATGACATG 


TCTCCAAGCA 


ACAATGTGGT 


TCCTATTCAT 


GTTCCACCAA 


660 


CCACTGAAAA 


CAAGCCAAAG 


GTGCAGTGCT 


GTCAGAACAT 


CTAAGGCATT 


TCTCTTCTCC 


720 


CCTAGAAGGC 


TGTGTATAGT 


CCATTTCCCA 


GGTCTSASAT 


TTAAATATAW 


TTGTAATTCT 


780 


TGTGTCAC-TT 


TTGT6TTTTA 


TTACTTCATA 


CTTATGAATT 


TTTCCATGTC 


CTAAGTCTTT 


840 


T6ATTTTGMT 


TTATAAAATC 


ATCCACTTGT 


NCCGAATGNC 


TGCAGCTTTT 


TTTCATGCTA 


900 


TGGCTTCACT 


AGCCTTAGTT 


TNATAAACTG 


AATGTTTGGA 


TTCCTCCCCC 


CAAAAAAAAA 


960 


AAAACTCGAG 












970 


(2) INFORMATION FOR SEQ ID NO: 8 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDCONESS: Single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 2? 4 

(D) OTHER INFORMATION: /note- *'Y2H23b** 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



(xi) SEQUENCE DESCRIPTION: SEO Id'^NOiB: 

GAATTCGCGG CCGNGTCGAC CCCCCACCCC CGATGCCACC ACCCCCANTG GGNTCTCCCN 

NCCCA6TCAT CAGTTCTTCC ATGGNGTNCC CTGGTCTGCC CCCTCCAGCT CCCCCAGGCN 

TTCTCCGGGT CTGNCAGCAG CCNCCAGATT AACTCAACAG TGTCACTCCC TGGGGGTGGG 

TCTGGNCCCC CTGANGATGT GAAGCCACCA GTCTNAGNGG TCCGGGGTCT GTACTGTCCA 
CCCCCTCCAG GTGGACCTGG CGCT 
(2) INFORMATION FOR SEQ ID N0:9: 

ti) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 3T0 

(D) OTHER INFORMATION: /note- *'Y2H27" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GAATTCGCGG CCGCGTCGAC CGCGGTCGCG TCGACCTGTT GCCCAGGCCC TAGAGGTCAT 
TCCTCGTACC CTGATCCAGA ACTGTGGGGC CAGCACCATC CGTCTACTTA CCTCCCTTCG 
GGCCAAGCAC ACCCAGGAGA ACTGTGAGAC CTGGGGTGTA AATGGTGAGA CGGGTACTTT 
GGTGGACATG AAGGAACTGG GCATATGGGA GCCATTGGCT GTGAAGCTGC AGACTTATAA 
GACAGCAGTG GAGACGGCAG TTCTGCTACT GCGAATTGAT GACATCGTTT CAGGCCACAA 
AAAGAAAGGC GATGACCAGA GCCGGCAAGG CGGNGCTCCT 
(2) INFORMATION FOR SEQ ID N0:10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 404 base pairs 

(B) TYPE: nucleic aciS 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 404 

(D) OTHER INFORMATION: /note- "Y2H35" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:I0; 
GAATTCGCGG TCGCGTCGAC GGTTAGTCCC ACTGGNCGCA TCGAGGGNTT CACCAACGTC 
ATGGAGCTGT ATGGCANGAT CGCCGAGGTC TTCCNCCTGC CAACTGCCGA GGTGATGTTC 
TGCACCCTGA NCACCCACAA AGTGGACATN GACAAGCTCC TGGGGGGCCA GATCGGGCTG 
GAGGACTTCA TCTTCGCCCA CGTGAAGGGG YAGCGCAAGG AGGTGGAGGT GTTCAWGTCG 
GAGGATGYAC TCGGKCTCAC CATCACGGAC AACGGGGCTG GCTACGCTTC CATCAAGCGC 
ATCAAGGAGG GCAGCGTGAT CGACCACATC CACCTCATCA GCGTGGGCGA CATGATCGAG 
GCCATTAACG GGCAGAGCTT CCTGGGCTGC CGGCATTACG AGGT 
(2) INFORMATION FOR SEQ ID N0:11: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(A) NAME/KEY: misc feature 
tB) LOCATION: 1..350 

(D) OTHER INFORMATION: /note- '•Y2H171- 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
GAATTCGCGG CCGCGTCGAC AAAAAAAGTA AAAGG AACTC ^ GGC AAATC XT ACCCCGCCTG 
TTTACCAAAA ACATCACCTC TAGCATCACC AGTATTAGAG GCACCGCCTG CCCAGTGACA 



60 
120 
180 
240 
264 



60 
120 
ISO 
240 
300 
340 



60 
120 
180 
240 
300 
360 
404 



60 
120 
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CATGTTTAAC GGCCGCGGTA CCCTAACCGT GCAAAGGTAG CATAATCACT TGTTCCTTAA 180 

GTAGGGACCT GTATGAATGG CTCCACGAGG GTTCAGCTGT CTCTTACTTT TAACCARTGA 240 

AATTGACCTG CCCGTGAAGA GGCGGGCATG ACACAGCAAG ACGAGAAGAC CCTATGGAGC 300 

TT7AATTTAT TAATGCAAAC AGTACCTAAC AAACCCACAG GGTCCTAAAC 350 
(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNES5 : single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B ) LOCATION: 1 . . 350 

(D) OTHER INFORMATION: /note- ••Y2H41- 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 

GAATTCGCGG NCGCGTCGAC AGATAATGAA AAAACCAGAG GTTCCCTTCT TTGGTCCCCT 60 

NNNNGATGGT GCTATTGTGA ATGGAAAGGT TCTACCCATT ATGGTTAGAG CAACAGCTAT 120 

AAATGCAAGC CGTGCTCTGA AATCTCTGAT TCCATTGTAT CAAAACTTCT ATGAGGAGAG 180 

AGCACGATAC CTGCAAACAA TTGTCCAGCA CCACTTAGAA CCAACAACAT TTGAAGATTT 240 

TGNAGCACAG GTTTTTTCTC CAGCTCCCTA CCACCATTTA CCATCTGATG CCGTTGGCTC 300 

CTACCCAGAG ATTCTACCCA GTGAAAACTC CCACAGCAAC GCAGGTAGGA 350 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding at 

least a presenilin-interacting domain of a presenilin-interacting protein selected from 
the group consisting of a mammalian S5a (qjproximately residues 70-377 of SEQ ID 
NO: 2), GT24 (approximately residues 346-862 of SEQ ID NO: 4), p0071 
5 (approximately residues 509-1022 of SEQ ID NO: 6). Rabll (SEQ ID NO: 7), 
retinoid X receptor-p (SEQ ID N0:8). cytoplasmic chaperonin (SEQ ID NO: 9), 
Y2H35 (SEQ ID NO: 10), Y2H171 (SEQ ID NO: 11), and a Y2H41 (SEQ ID NO: 
12) presenilin-interacting domain. 

0 2, An isolated nucleic acid comprising a nucleotide sequence of at least 1 0 

consecutive nucleotides selected from the group consisting of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 
10, SEQ ID NO: 1 1, SEQ ID NO: 12. GenBank Accession Numbers F08730, T18858, 
X81889, X56740, X53143, M84820, X63522, M81766. U17104, X74801, R12984, 

> D55326, and T64843, and a sequence complementary to any of these sequences. 



3. " An isolated nucleic acid as in claim 2 comprising a nucleotide sequence of 
at least 15 consecutive nucleotides selected from said group. 



4. An isolated nucleic acid as in claim 2 comprising a nucleotide sequence of 

at least 20 consecutive nucleotides selected from said group. 
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5. An isolated nucleic acid comprising a nucleotide sequence encoding an 
antigenic determinant of a presenilin-interacting protein selected from the group 
consisting of a mammalian S5a, GT24, p0071, Rabll, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 

5 

6. A method for identifying allelic variants or heterospecific homologues of a 
hiunan presenilin-interacting protein gene comprising 

choosing a nucleic acid probe or primer capable of hybridizing to a human 
presenilin-interacting protein gene sequence under stringent hybridization conditions; 
10 mixing said probe or primer with a sample of nucleic acids which may 

contain a nucleic acid corresponding to said variant or homologue; 

detecting hybridization of said probe or primer to said nucleic acid 
corresponding to said variant or homologue. 

15 7. A method as in claim 6 wherein said sample comprises a sample of nucleic 

acids selected from the group consisting of human genomic DNA, human mRNA, and 
human cDN A. 

8. A method as in claim 6 wherein said sample comprises a sample of nucleic 
20 acids selected from the group consisting of mammalian genomic DNA, mammalian 

mRNA, and mammalian cDNA. 

9. A method as in claim 6 wherein said sample comprises a sample of nucleic 
acids selected fit)m the group consisting of invertebrate genomic DNA, invertebrate 

25 mRNA, and invertebrate cDNA. 
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10. A method as in claim 6 further comprising the step of isolating said nucleic 

acid corresponding to said variant or homologue. 



5 11. A method as in claim 6 wherein said nucleic acid is identified by 

hybridization. 



12. A method as in claim 6 wherein said nucleic acid is identified by PGR 

amplification. 



10 



13. A method for identifying allelic variants or heterospecific homologues of a 

human presenilin-interacting protein gene comprising 

choosing an antibody capable of selectively binding to a human 
presenilin-interacting protein; 
^5 mixing said antibody with a sample of proteins which may contain a 

protein corresponding to said variant or homologue; 

detecting binding of said antibody to said protein corresponding to said 
variant or homologue. 



20 1 4. A method as in claim 1 3 wherein said sample comprises a sample of 

proteins selected bom the group consisting of human proteins, human fusion proteins, 
and proteolytic firagments thereof 
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15. A method as in claim 13 wherein said sample comprises a sample of 

proteins selected from the group consisting of mammalian proteins, mammalian 
fusion proteins, and proteolytic fragments thereof. 

5 16. A method as in claim 13 wherein said sample comprises a sample of 

proteins selected from the group consisting of invertebrate proteins, invertebrate 
fusion proteins, and proteolytic fragments thereof. 

17. A method as in claim 13 further comprising the step of substantially 
10 piuifying said protein corresponding to said variant or homologue. 

18. An isolated nucleic acid comprising an allelic variant or a heterospecific 
homologue of a human presenilin*interacting protein gene. 

15 19. An isolated nucleic acid encoding an allelic variant or heterospecific 

homologue of a human presenilin-interacting protein. 

20. An isolated nucleic acid comprising a recombinant vector including a 
nucleotide sequence of any one of claims 1-19. 

20 

21. An isolated nucleic acid as in claim 20 wherein said vector is an 
expression vector and said presenilin-interacting protein nucleotide sequence is 
operably joined to a regulatory region. 
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22. An isolated nucleic acid as in claim 21 wherein said expression vector may 

express said presenilin-interacting protein sequence in mammalian cells. 



23. An isolated nucleic acid as in claim 22 wherein said cells are selected fix)m 

the group consisting of fibroblast, liver, kidney, spleen, bone marrow and neurological 
cells. 



24. An isolated nucleic acid as in claim 21 wherein said vector is selected from 

10 the group consisting of vaccinia virus, adenovirus, retrovirus, neurotropic viruses and 
Herpes simplex. 



25. An isolated nucleic acid as in claim 21 wherein said expression vector 

encodes at least a presenilin-interacting domain of a presenilin-interacting protein 
15 selected from the group consisting of a mammalian S5a, GT24, p0071 , Rabl 1 , 

retinoid X receptor-P, cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 



26. An isolated nucleic acid as in claim 2 1 wherein said vector further 

comprises sequences encoding an exogenous protein operably joined to said 
20 presenilin-interacting protein sequence and whereby said vector encodes a presenilin- 
interacting protein fusion protein. 



27. An isolated nucleic acid as in claim 26 wherein said exogenous protein is 

selected from the group consisting of lacZ, trpE, maltose-binding protein, a poly-His 
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tag, glutathione-S-transferase, a GAL4-DNA binding domain, and a GAL4 activation 
domain. 

28. An isolated nucleic acid comprising a recombinant expression vector 

5 including nucleotide sequences corresponding to an endogenous regulatory region of a 
presenilin-interacting protein gene. 

29. An isolated nucleic acid as in claim 28 wherein said endogenous 
regulatory region is operably joined to a marker gene. 

10 

30. A host cell transformed with an expression vector of any one of claims 20- 
29, or a descendant thereof. 

31. A host cell as in claim 30 wherein said host cell is selected from the group 
15 consisting of bacterial cells and yeast cells. 

32. A host cell as in claim 30 wherein said host cell is selected from the group 
consisting of fetal cells, embryonic stem cells, zygotes, gametes, and germ line cells. 

20 33. A host cell as in claim 30 wherein said cell is selected from the group 

consisting of fibroblast, liver, kidney, spleen, bone marrow and neurological cells. 
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A host cell as in claim 30 wherein said cell is an invertebrate cell. 



35. A non-human animal model for Alzheimer's Disease, wherein a genome of 
said animal, or an ancestor thereof, has been modified by at least one recombinant 
construct, and wherein said recombinant construct has introduced a modification 
selected fiom the group consisting of (1) insertion of nucleotide sequences encoding 
at least a functional domain of a heterospecific normal presenilin-interacting protein, 
(2) insertion of nucleotide sequences encoding at least a functional domain of a 
heterospecific mutant presenilin-interacting protein, (3) insertion of nucleotide 
sequences encoding at least a functional domain of a conspecific homologue of a 
heterospecific mutant presenilin-interacting protein, and (4) inactivation of an 
endogenous presenilin-interacting protein gene. 



36. An animal as in claim 35 wherein said modification is insertion of a 

nucleotide sequence encoding at least a functional domain of a normal human 
presenilin-interacting protein selected from the group consisting of a mammalian S5a, 
GT24. p0071, Rabl 1, retinoid X receptor-P, cytoplasmic chaperonin, Y2H35, 
Y2H171. and Y2H41 protein. 



37. An animal as in claim 35 wherein said modification is insertion of a 

nucleotide sequence encoding at least a fimctional domain of a mutant human 
presenilin-interacting protein selected from the group consisting of a mammalian S5a, 
GT24, p0071. Rabl 1, retinoid X receptor-p, cytoplasmic chaperonin, Y2H35, 
Y2H171. and Y2H41 protein. 
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38. An animal as in claim 35 wherein said animal is selected from the group 

consisting of rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, 
and non-human primates. 



39. An animal as in claim 35 wherein said animal is an invertebrate. 



10 



40. A method for producing at least a functional domain of a presenilin- 

interacting protein comprising culturing a host cell of any of claims 30-34 under 
suitable conditions to produce said presenilin by expressing said nucleic acid. 



41 . A substantially pure preparation of a protein selected from the group 

consisting of a mammalian S5a. GT24, p0071, Rabl 1, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 



15 42. A substantially pure preparation of a polypeptide comprising an amino 

acid sequence of at least 10 consecutive amino acid residues selected from the group 
consisting SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and GenBank Accession 
Numbers F08730, T18858, X81889, X56740, X53143, M84820, X63522, M81766, 
U17104, X74801, R12984;D55326, and T64843. 

20 

43. A substantially pure preparation of a polypeptide as in claim 42 

comprising an amino acid sequence of at least 15 consecutive amino acid residues 
, selected from said group. 
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44. A substantially pure preparation of a polypeptide comprising at least a 
presenilin-interacting domain of a presenilin-interacting protein selected from the 
group consisting of a mammalian S5a, GT24, p0071, RabI I, retinoid X receptor-p. 
5 cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 



45. A substantially pure preparation of a polypeptide comprising an antigenic 

determinant of a presenilin-interacting protein selected from the group consisting of a 
mammalian S5a, GT24, pO071. Rabll. retinoid X receptor-p, cytoplasmic 
10 chaperonin, Y2H35, Y2H1 71 , and Y2H41 protein. 



46. A method of producing antibodies which selectively bind to a presenilin- 

interacting protein comprising the steps of 

administering an immunogenically effective amount of a presenilin- 
15 interacting protein inununogen to an animal; 

allowing said animal to produce antibodies to said immunogen; and 
obtaining said anribodies from said animal or from a cell culture derived 

therefrom. 



20 47. A substantially pure preparation ofan antibody which selectively binds to 

an antigenic determinant of a presenilin-interacting protein selected from the group 
consisting of a mammalian S5a, GT24, p007I, Rabl 1, retinoid X receptor-p. 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H4I protein. 
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48. A substantially pure preparation of an antibody as in claim 47 wherein said 

antibody selectively binds to an antigenic detenninant of a mutant presenilin- 
interacting protein and fails to bind to a normal presenilin-interacting protein. 

5 49. A cell line producing an antibody of any one of claims 47-48. 

50. A method for identifying compounds which can modulate the expression 
of a presenilin-interacting protein gene comprising 

contacting a cell with a test candidate wherein said cell includes a 
10 regulatory region of a presenilin-interacting protein gene operably joined to a coding 
region; and 

detecting a change in expression of said coding region. 

51. A method as in claim 50 wherein said change comprises a change in a 
15 level of an mRNA transcript encoded by said coding region. 

52. A method as in claim 50 wherein said change comprises a change in a 
level of a protein encoded by said coding region. 

20 53. A method as in claim 50 wherein said change is a result of an activity of a 

protein encoded by said coding region. 
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54. A method as in claim 50 wherein said coding region encodes a marker 

protein selected from the group consisting of p-galactosidase, alkaline phosphatase, 
green fluorescent protein, and luciferase. 



5 55. A method for identifying compounds which can selectively bind to a 
presenilin-interacting protein comprising the steps of 

providing a preparation including at least one presenilin-interacting protein 
component; 

contacting said preparation with a sample including at least one candidate 
10 compound; and 

detecting binding of said presenilin-interacting protein component to said 
candidate compound. 



56. The method in 55 wherein said binding to said presenilin-interacting 
component is detected by an assay selected from the group consisting of: afTinity 
chromatography, co-immunoprecipitation, a Biomolecular Interaction Assay, and 
yeast two-hybrid system. 



57. A method of identifying compounds which can modulate activity of a 

20 presenilin-interacting protein comprising the steps of 

providing a cell expressing a normal or mutant presenilin-interacting 
protein gene; 

contacting said cell with at least one candidate compound; and 
detecting a change in a marker of said activity. 

25 
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58. A method as in claim 57 wherein measurement of said marker indicates a 
difference between cells bearing an expressed mutant presenilin-interacting protein 
gene and otherwise identical cells free of an expressed mutant presenilin-interacting 
protein gene. 

5 

59. A method as in claim 57 wherein said change comprises a change in a non- 
specific marker of cell physiology selected from the group consisting of pH; 
intracellular Ca"^, Na\ or K*; cyclic AMP levels; GTP/GDP ratios; 
phosphatidylinositol activity; and protein phosphorylation. 

10 

60. A method as in claim 57 wherein said change comprises a change in 
expression of said presenilin-interacting protein. 

61 . A method as in claim 57 wherein said change comprises a change in 

15 intracellular concentration or flux of an ion selected from the group consisting of Ca^\ 
Na*andK\ 

62. A method as in claim 57 wherein said change comprises a change in 
occurrence or rate of apoptosis or cell death. 

20 

63. A method as in claim 57 wherein said change comprises a change in 
production of AP peptides. 
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64. A method as in claim 57 wherein said change comprises a change in 
phosphorylation of at least one microtubule associated protein. 



65. A method as in claim 57 wherein said cell is a cell cultured in vitro 



66. A method as in claim 65 wherein said cell is a transformed host cell of any 

one of claims 30-34. 



67. A method as in claim 65 wherein said cell is explanted from a host bearing 
10 at least one mutant presenilin-interacting protein gene. 



68. A method as in claim 65 wherein said cell is explanted from a transgenic 

animal of any one of claims 35-39. 



15 69. A method as in claim 57 wherein said cell is a cell in a live animal. 



70. A method as in claim 69 wherein said cell is a cell of a transgenic animal 

of any one of claims 35-39. 



20 71. A. method as in claim 57 wherein said cell is in a human subject in a 

clinical trial. 
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72. A diagnostic method for detennining if a subject bears a mutant presenilin- 
interacting protein gene comprising the steps of 

providing a biological sample of said subject; 
5 detecting in said sample a mutant presenilin-interacting protein nucleic 

acid, a mutant presenilin-interacting protein, or a mutant presenilin-interacting protein 
activity. 

73. A method as in claim 72, wherein a mutant presenilin-interacting protein 
10 nucleic acid is detected by an assay selected from the group consisting of direct 

nucleotide sequencing, probe specific hybridization, restriction enzyme digest and 
mapping, PCR mapping, ligase-mediated PCR detection, RNase protection, 
electrophoretic mobility shift detection, and chemical mismatch cleavage. 

15 74. A method as in claim 72, wherein a mutant presenilin-interacting protein is 

detected by an assay selected from the group consisting of an immunoassay, a 
protease assay, and an electrophoretic mobility assay. 

75. A pharmaceutical preparation comprising a substantially pure presenilin- 
20 interacting protein and a pharmaceutically acceptable carrier. 

76. A pharmaceutical preparation comprising an expression vector operably 
encoding a presenilin-interacting protein, wherein said expression vector may express 
said presenilin-interacting protein in a human subject, and a pharmaceutically 

25 acceptable carrier. 
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77. A pharmaceutical preparation comprising an expression vector operably 

encoding a presenilin-interacting protein antisense sequence, wherein said expression 
vector may express said preseniUn-interacting protein antisense sequence in a human 
subject, and a phaimaceutically acceptable carrier. 



78. A pharmaceutical preparation comprising a substantially pure antibody, 

wherein said antibody selectively binds to a mutant presenilin-interacting protein, and 
a pharmaceutically acceptable carrier. 



79. A pharmaceutical preparation as in claim 78 wherein said preparation is 

essentially free of an antibody which selectively binds a normal presenilin-interacting 
protein. 



15 80. A pharmaceutical preparation comprising a substantially pure preparation 

of an antigenic determinant of a mutant presenilin-interacting protein. 



,81. A pharmaceutical preparation as in claim 80 wherein said preparation is 
essentially free of an antigenic determinant of a normal presenilin-interacting protein. 



20 



82. A method of treatment for a patient bearing a mutant presenilin-interacting 

protein gene comprising the step of administering to said patient a therapeutically 
effective amount of the pharmaceutical preparation of any one of claims 75-81. 
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83. A method as in claim 82, wherein said pharmaceutical preparation is 

targeted to a cell type is selected from the group consisting of heart, brain, lung, liver, 
skeletal muscle, kidney, pancreas and neurological cells. 
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